Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 63578 |
| Missing cells | 614883 |
| Missing cells (%) | 26.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 17.5 MiB |
| Average record size in memory | 288.0 B |
Variable types
| Categorical | 24 |
|---|---|
| Numeric | 7 |
| Unsupported | 5 |
TR6_NO has a high cardinality: 63556 distinct values | High cardinality |
HOUSE_NO has a high cardinality: 4189 distinct values | High cardinality |
STREET_NAME has a high cardinality: 2516 distinct values | High cardinality |
SUBMITTED_ON has a high cardinality: 4461 distinct values | High cardinality |
QEWI_NAME has a high cardinality: 2111 distinct values | High cardinality |
QEWI_BUS_NAME has a high cardinality: 4694 distinct values | High cardinality |
QEWI_BUS_STREET_NAME has a high cardinality: 3901 distinct values | High cardinality |
QEWI_CITY has a high cardinality: 541 distinct values | High cardinality |
QEWI_ZIP has a high cardinality: 233 distinct values | High cardinality |
QEWI_NYS_LIC_NO has a high cardinality: 471 distinct values | High cardinality |
OWNER_NAME has a high cardinality: 6058 distinct values | High cardinality |
OWNER_BUS_NAME has a high cardinality: 26232 distinct values | High cardinality |
FILING_DATE has a high cardinality: 4464 distinct values | High cardinality |
PRIOR_CYCLE_FILING_DATE has a high cardinality: 5274 distinct values | High cardinality |
FIELD_INSPECTION_COMPLETED_DATE has a high cardinality: 5145 distinct values | High cardinality |
QEWI_SIGNED_DATE has a high cardinality: 5198 distinct values | High cardinality |
COMMENTS has a high cardinality: 9038 distinct values | High cardinality |
CONTROL_NO is highly correlated with CYCLE | High correlation |
CYCLE is highly correlated with CONTROL_NO | High correlation |
BIN is highly correlated with BLOCK | High correlation |
BLOCK is highly correlated with BIN | High correlation |
LATE_FILING_AMT is highly correlated with FAILURE_TO_FILE_AMT | High correlation |
FAILURE_TO_FILE_AMT is highly correlated with LATE_FILING_AMT | High correlation |
CONTROL_NO is highly correlated with CYCLE | High correlation |
CYCLE is highly correlated with CONTROL_NO | High correlation |
BIN is highly correlated with BLOCK | High correlation |
BLOCK is highly correlated with BIN | High correlation |
LATE_FILING_AMT is highly correlated with FAILURE_TO_FILE_AMT | High correlation |
FAILURE_TO_FILE_AMT is highly correlated with LATE_FILING_AMT | High correlation |
CONTROL_NO is highly correlated with CYCLE | High correlation |
CYCLE is highly correlated with CONTROL_NO | High correlation |
BIN is highly correlated with BLOCK | High correlation |
BLOCK is highly correlated with BIN | High correlation |
LATE_FILING_AMT is highly correlated with FAILURE_TO_FILE_AMT | High correlation |
FAILURE_TO_FILE_AMT is highly correlated with LATE_FILING_AMT | High correlation |
FILING_STATUS is highly correlated with FILING_TYPE and 1 other fields | High correlation |
FILING_TYPE is highly correlated with FILING_STATUS | High correlation |
CURRENT_STATUS is highly correlated with FILING_STATUS | High correlation |
CONTROL_NO is highly correlated with FILING_TYPE and 3 other fields | High correlation |
FILING_TYPE is highly correlated with CONTROL_NO and 4 other fields | High correlation |
CYCLE is highly correlated with CONTROL_NO and 3 other fields | High correlation |
BIN is highly correlated with BOROUGH and 1 other fields | High correlation |
BOROUGH is highly correlated with BIN | High correlation |
BLOCK is highly correlated with BIN | High correlation |
CURRENT_STATUS is highly correlated with CONTROL_NO and 3 other fields | High correlation |
FILING_STATUS is highly correlated with CONTROL_NO and 3 other fields | High correlation |
PRIOR_STATUS is highly correlated with FILING_TYPE | High correlation |
LATE_FILING_AMT is highly correlated with FAILURE_TO_FILE_AMT | High correlation |
FAILURE_TO_FILE_AMT is highly correlated with LATE_FILING_AMT | High correlation |
SEQUENCE_NO has 2333 (3.7%) missing values | Missing |
SUBMITTED_ON has 12356 (19.4%) missing values | Missing |
QEWI_NAME has 13769 (21.7%) missing values | Missing |
QEWI_BUS_NAME has 14523 (22.8%) missing values | Missing |
QEWI_BUS_STREET_NAME has 12392 (19.5%) missing values | Missing |
QEWI_CITY has 12887 (20.3%) missing values | Missing |
QEWI_STATE has 12395 (19.5%) missing values | Missing |
QEWI_ZIP has 44195 (69.5%) missing values | Missing |
QEWI_NYS_LIC_NO has 44173 (69.5%) missing values | Missing |
OWNER_NAME has 44158 (69.5%) missing values | Missing |
OWNER_BUS_NAME has 11812 (18.6%) missing values | Missing |
OWNER_BUS_STREET_NAME has 63578 (100.0%) missing values | Missing |
OWNER_CITY has 63578 (100.0%) missing values | Missing |
OWNER_ZIP has 63578 (100.0%) missing values | Missing |
OWNER_STATE has 63578 (100.0%) missing values | Missing |
FILING_DATE has 12782 (20.1%) missing values | Missing |
PRIOR_CYCLE_FILING_DATE has 20508 (32.3%) missing values | Missing |
PRIOR_STATUS has 17851 (28.1%) missing values | Missing |
FIELD_INSPECTION_COMPLETED_DATE has 16664 (26.2%) missing values | Missing |
QEWI_SIGNED_DATE has 17763 (27.9%) missing values | Missing |
LATE_FILING_AMT has 1385 (2.2%) missing values | Missing |
FAILURE_TO_FILE_AMT has 1379 (2.2%) missing values | Missing |
FAILURE_TO_COLLECT_AMT has 1200 (1.9%) missing values | Missing |
COMMENTS has 45747 (72.0%) missing values | Missing |
FAILURE_TO_COLLECT_AMT is highly skewed (γ1 = 20.51379133) | Skewed |
TR6_NO is uniformly distributed | Uniform |
SEQUENCE_NO is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
OWNER_BUS_STREET_NAME is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
OWNER_CITY is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
OWNER_ZIP is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
OWNER_STATE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
LATE_FILING_AMT has 19300 (30.4%) zeros | Zeros |
FAILURE_TO_FILE_AMT has 41113 (64.7%) zeros | Zeros |
FAILURE_TO_COLLECT_AMT has 51488 (81.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-06-30 21:04:25.851923 |
|---|---|
| Analysis finished | 2022-06-30 21:04:50.440029 |
| Duration | 24.59 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 63556 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| TR6-814993-8A-N1 | 4 |
|---|---|
| TR6-610070-NA-I1 | 4 |
| TR6-815008-8B-N1 | 4 |
| TR6-812231-8B-N1 | 4 |
| TR6-815008-8B-I2 | 2 |
| Other values (63551) |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 16.00003146 |
| Min length | 16 |
Characters and Unicode
| Total characters | 1017250 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 63542 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | TR6-913448-9A-N1 |
|---|---|
| 2nd row | TR6-913451-9A-N1 |
| 3rd row | TR6-913456-9A-N1 |
| 4th row | TR6-913458-9A-N1 |
| 5th row | TR6-913460-9A-N1 |
Common Values
| Value | Count | Frequency (%) |
| TR6-814993-8A-N1 | 4 | < 0.1% |
| TR6-610070-NA-I1 | 4 | < 0.1% |
| TR6-815008-8B-N1 | 4 | < 0.1% |
| TR6-812231-8B-N1 | 4 | < 0.1% |
| TR6-815008-8B-I2 | 2 | < 0.1% |
| TR6-613144-NA-N1 | 2 | < 0.1% |
| TR6-601435-NA-I1 | 2 | < 0.1% |
| TR6-800351-8A-S1 | 2 | < 0.1% |
| TR6-613144-NA-I1 | 2 | < 0.1% |
| TR6-613144-NA-A1 | 2 | < 0.1% |
| Other values (63546) | 63550 |
Length
| Value | Count | Frequency (%) |
| tr6-814993-8a-n1 | 4 | < 0.1% |
| tr6-815008-8b-n1 | 4 | < 0.1% |
| tr6-812231-8b-n1 | 4 | < 0.1% |
| tr6-610070-na-i1 | 4 | < 0.1% |
| tr6-613144-na-a1 | 2 | < 0.1% |
| tr6-815013-8b-i1 | 2 | < 0.1% |
| tr6-613144-na-s1 | 2 | < 0.1% |
| tr6-812231-8b-i1 | 2 | < 0.1% |
| tr6-601435-na-n1 | 2 | < 0.1% |
| tr6-613144-na-i1 | 2 | < 0.1% |
| Other values (63546) | 63550 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 190734 | |
| 1 | 109905 | |
| 6 | 103851 | |
| 0 | 71191 | 7.0% |
| 8 | 67294 | 6.6% |
| T | 63578 | 6.2% |
| R | 63578 | 6.2% |
| 7 | 55220 | 5.4% |
| A | 46084 | 4.5% |
| 9 | 44317 | 4.4% |
| Other values (9) | 201498 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 556239 | |
| Uppercase Letter | 270277 | |
| Dash Punctuation | 190734 | 18.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 109905 | |
| 6 | 103851 | |
| 0 | 71191 | |
| 8 | 67294 | |
| 7 | 55220 | |
| 9 | 44317 | |
| 2 | 29007 | 5.2% |
| 3 | 26478 | 4.8% |
| 4 | 25288 | 4.5% |
| 5 | 23688 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 63578 | |
| R | 63578 | |
| A | 46084 | |
| I | 43361 | |
| N | 28292 | |
| B | 12314 | 4.6% |
| C | 11743 | 4.3% |
| S | 1327 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 190734 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 746973 | |
| Latin | 270277 | 26.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 190734 | |
| 1 | 109905 | |
| 6 | 103851 | |
| 0 | 71191 | 9.5% |
| 8 | 67294 | 9.0% |
| 7 | 55220 | 7.4% |
| 9 | 44317 | 5.9% |
| 2 | 29007 | 3.9% |
| 3 | 26478 | 3.5% |
| 4 | 25288 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| T | 63578 | |
| R | 63578 | |
| A | 46084 | |
| I | 43361 | |
| N | 28292 | |
| B | 12314 | 4.6% |
| C | 11743 | 4.3% |
| S | 1327 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1017250 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 190734 | |
| 1 | 109905 | |
| 6 | 103851 | |
| 0 | 71191 | 7.0% |
| 8 | 67294 | 6.6% |
| T | 63578 | 6.2% |
| R | 63578 | 6.2% |
| 7 | 55220 | 5.4% |
| A | 46084 | 4.5% |
| 9 | 44317 | 4.4% |
| Other values (9) | 201498 |
| Distinct | 49000 |
|---|---|
| Distinct (%) | 77.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 749000.7584 |
| Minimum | 600001 |
|---|---|
| Maximum | 919118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 600001 |
|---|---|
| 5-th percentile | 602634.85 |
| Q1 | 613118.25 |
| median | 800117.5 |
| Q3 | 811863 |
| 95-th percentile | 911902.15 |
| Maximum | 919118 |
| Range | 319117 |
| Interquartile range (IQR) | 198744.75 |
Descriptive statistics
| Standard deviation | 104353.8707 |
|---|---|
| Coefficient of variation (CV) | 0.1393241189 |
| Kurtosis | -1.183629215 |
| Mean | 749000.7584 |
| Median Absolute Deviation (MAD) | 95285 |
| Skewness | -0.006453664325 |
| Sum | 4.761997022 × 1010 |
| Variance | 1.088973034 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 613144 | 8 | < 0.1% |
| 814901 | 6 | < 0.1% |
| 812231 | 6 | < 0.1% |
| 814903 | 6 | < 0.1% |
| 815008 | 6 | < 0.1% |
| 916004 | 5 | < 0.1% |
| 816552 | 5 | < 0.1% |
| 802224 | 5 | < 0.1% |
| 801989 | 5 | < 0.1% |
| 807142 | 5 | < 0.1% |
| Other values (48990) | 63521 |
| Value | Count | Frequency (%) |
| 600001 | 1 | |
| 600003 | 1 | |
| 600004 | 1 | |
| 600005 | 1 | |
| 600006 | 1 | |
| 600007 | 1 | |
| 600008 | 1 | |
| 600009 | 1 | |
| 600010 | 1 | |
| 600011 | 1 |
| Value | Count | Frequency (%) |
| 919118 | 1 | |
| 919116 | 1 | |
| 919115 | 1 | |
| 919114 | 1 | |
| 919113 | 1 | |
| 919110 | 1 | |
| 919109 | 2 | |
| 919108 | 1 | |
| 919106 | 1 | |
| 919105 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| Initial | |
|---|---|
| Auto-Generated | |
| Amended | |
| Subsequent | 1327 |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 8.419830759 |
| Min length | 7 |
Characters and Unicode
| Total characters | 535316 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Auto-Generated |
|---|---|
| 2nd row | Auto-Generated |
| 3rd row | Auto-Generated |
| 4th row | Auto-Generated |
| 5th row | Auto-Generated |
Common Values
| Value | Count | Frequency (%) |
| Initial | 43361 | |
| Auto-Generated | 12327 | 19.4% |
| Amended | 6563 | 10.3% |
| Subsequent | 1327 | 2.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| initial | 43361 | |
| auto-generated | 12327 | 19.4% |
| amended | 6563 | 10.3% |
| subsequent | 1327 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 86722 | |
| t | 69342 | |
| n | 63578 | |
| a | 55688 | |
| e | 52761 | |
| I | 43361 | |
| l | 43361 | |
| d | 25453 | 4.8% |
| A | 18890 | 3.5% |
| u | 14981 | 2.8% |
| Other values (9) | 61179 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 447084 | |
| Uppercase Letter | 75905 | 14.2% |
| Dash Punctuation | 12327 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 86722 | |
| t | 69342 | |
| n | 63578 | |
| a | 55688 | |
| e | 52761 | |
| l | 43361 | |
| d | 25453 | 5.7% |
| u | 14981 | 3.4% |
| r | 12327 | 2.8% |
| o | 12327 | 2.8% |
| Other values (4) | 10544 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 43361 | |
| A | 18890 | |
| G | 12327 | 16.2% |
| S | 1327 | 1.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12327 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 522989 | |
| Common | 12327 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 86722 | |
| t | 69342 | |
| n | 63578 | |
| a | 55688 | |
| e | 52761 | |
| I | 43361 | |
| l | 43361 | |
| d | 25453 | 4.9% |
| A | 18890 | 3.6% |
| u | 14981 | 2.9% |
| Other values (8) | 48852 |
Common
| Value | Count | Frequency (%) |
| - | 12327 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 535316 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 86722 | |
| t | 69342 | |
| n | 63578 | |
| a | 55688 | |
| e | 52761 | |
| I | 43361 | |
| l | 43361 | |
| d | 25453 | 4.8% |
| A | 18890 | 3.5% |
| u | 14981 | 2.8% |
| Other values (9) | 61179 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| 8 | |
|---|---|
| 6 | |
| 7 | |
| 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 63578 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| 3rd row | 9 |
| 4th row | 9 |
| 5th row | 9 |
Common Values
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 63578 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 63578 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63578 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 21555 | |
| 6 | 15965 | |
| 7 | 15662 | |
| 9 | 10396 |
| Distinct | 15680 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 28 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1940923.106 |
| Minimum | 1000000 |
|---|---|
| Maximum | 5863301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 1000000 |
|---|---|
| 5-th percentile | 1007979.05 |
| Q1 | 1035344.5 |
| median | 1081661 |
| Q3 | 3110168 |
| 95-th percentile | 4432026 |
| Maximum | 5863301 |
| Range | 4863301 |
| Interquartile range (IQR) | 2074823.5 |
Descriptive statistics
| Standard deviation | 1219255.774 |
|---|---|
| Coefficient of variation (CV) | 0.6281834505 |
| Kurtosis | -0.5214484333 |
| Mean | 1940923.106 |
| Median Absolute Deviation (MAD) | 68100 |
| Skewness | 0.9819193684 |
| Sum | 1.233456634 × 1011 |
| Variance | 1.486584643 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1084781 | 38 | 0.1% |
| 1000000 | 25 | < 0.1% |
| 3335934 | 20 | < 0.1% |
| 1077591 | 19 | < 0.1% |
| 3253907 | 19 | < 0.1% |
| 1077585 | 18 | < 0.1% |
| 1081661 | 17 | < 0.1% |
| 1088305 | 16 | < 0.1% |
| 3345581 | 16 | < 0.1% |
| 1087284 | 16 | < 0.1% |
| Other values (15670) | 63346 | |
| (Missing) | 28 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000000 | 25 | |
| 1000005 | 5 | < 0.1% |
| 1000006 | 6 | < 0.1% |
| 1000007 | 5 | < 0.1% |
| 1000016 | 6 | < 0.1% |
| 1000018 | 4 | < 0.1% |
| 1000020 | 4 | < 0.1% |
| 1000021 | 5 | < 0.1% |
| 1000023 | 5 | < 0.1% |
| 1000024 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 5863301 | 1 | < 0.1% |
| 5160021 | 6 | |
| 5158679 | 3 | |
| 5158313 | 5 | |
| 5157567 | 6 | |
| 5157402 | 4 | |
| 5156898 | 1 | < 0.1% |
| 5150768 | 1 | < 0.1% |
| 5141912 | 1 | < 0.1% |
| 5122638 | 5 |
| Distinct | 4189 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| 1 | 364 |
|---|---|
| 30 | 342 |
| 50 | 342 |
| 40 | 331 |
| 200 | 326 |
| Other values (4184) |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 3.25751046 |
| Min length | 1 |
Characters and Unicode
| Total characters | 207106 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 251 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 143-45 |
|---|---|
| 2nd row | 15 |
| 3rd row | 180 |
| 4th row | 41-46 |
| 5th row | 220 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 364 | 0.6% |
| 30 | 342 | 0.5% |
| 50 | 342 | 0.5% |
| 40 | 331 | 0.5% |
| 200 | 326 | 0.5% |
| 60 | 320 | 0.5% |
| 100 | 317 | 0.5% |
| 20 | 297 | 0.5% |
| 15 | 280 | 0.4% |
| 150 | 279 | 0.4% |
| Other values (4179) | 60380 |
Length
| Value | Count | Frequency (%) |
| 1 | 364 | 0.6% |
| 30 | 342 | 0.5% |
| 50 | 342 | 0.5% |
| 40 | 331 | 0.5% |
| 200 | 326 | 0.5% |
| 60 | 320 | 0.5% |
| 100 | 317 | 0.5% |
| 20 | 297 | 0.5% |
| 15 | 280 | 0.4% |
| 150 | 279 | 0.4% |
| Other values (4178) | 60384 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 37633 | |
| 0 | 28034 | |
| 2 | 25441 | |
| 5 | 23695 | |
| 3 | 20974 | |
| 4 | 18270 | |
| 6 | 13197 | 6.4% |
| 7 | 11531 | 5.6% |
| 8 | 10920 | 5.3% |
| 9 | 9912 | 4.8% |
| Other values (3) | 7499 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 199607 | |
| Dash Punctuation | 7491 | 3.6% |
| Space Separator | 4 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 37633 | |
| 0 | 28034 | |
| 2 | 25441 | |
| 5 | 23695 | |
| 3 | 20974 | |
| 4 | 18270 | |
| 6 | 13197 | 6.6% |
| 7 | 11531 | 5.8% |
| 8 | 10920 | 5.5% |
| 9 | 9912 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7491 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 207102 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 37633 | |
| 0 | 28034 | |
| 2 | 25441 | |
| 5 | 23695 | |
| 3 | 20974 | |
| 4 | 18270 | |
| 6 | 13197 | 6.4% |
| 7 | 11531 | 5.6% |
| 8 | 10920 | 5.3% |
| 9 | 9912 | 4.8% |
| Other values (2) | 7495 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| A | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 207106 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 37633 | |
| 0 | 28034 | |
| 2 | 25441 | |
| 5 | 23695 | |
| 3 | 20974 | |
| 4 | 18270 | |
| 6 | 13197 | 6.4% |
| 7 | 11531 | 5.6% |
| 8 | 10920 | 5.3% |
| 9 | 9912 | 4.8% |
| Other values (3) | 7499 | 3.6% |
| Distinct | 2516 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| BROADWAY | 1820 |
|---|---|
| FIFTH AVENUE | 1247 |
| PARK AVENUE | 975 |
| MADISON AVENUE | 815 |
| RIVERSIDE DRIVE | 661 |
| Other values (2511) |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 13.6754695 |
| Min length | 6 |
Characters and Unicode
| Total characters | 869459 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 143 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | SANFORD AVENUE |
|---|---|
| 2nd row | OLIVER STREET |
| 3rd row | ELDRIDGE STREET |
| 4th row | 50 STREET |
| 5th row | EAST 19 STREET |
Common Values
| Value | Count | Frequency (%) |
| BROADWAY | 1820 | 2.9% |
| FIFTH AVENUE | 1247 | 2.0% |
| PARK AVENUE | 975 | 1.5% |
| MADISON AVENUE | 815 | 1.3% |
| RIVERSIDE DRIVE | 661 | 1.0% |
| WEST END AVENUE | 641 | 1.0% |
| LEXINGTON AVENUE | 543 | 0.9% |
| THIRD AVENUE | 481 | 0.8% |
| SECOND AVENUE | 392 | 0.6% |
| 7 AVENUE | 376 | 0.6% |
| Other values (2506) | 55627 |
Length
| Value | Count | Frequency (%) |
| street | 31448 | |
| avenue | 20726 | 13.6% |
| west | 12301 | 8.1% |
| east | 10657 | 7.0% |
| park | 1962 | 1.3% |
| broadway | 1944 | 1.3% |
| boulevard | 1804 | 1.2% |
| place | 1413 | 0.9% |
| road | 1382 | 0.9% |
| drive | 1296 | 0.8% |
| Other values (1462) | 67680 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 156557 | |
| T | 103075 | |
| 92144 | ||
| S | 69441 | 8.0% |
| A | 62986 | 7.2% |
| R | 61480 | 7.1% |
| N | 43496 | 5.0% |
| U | 28797 | 3.3% |
| V | 26947 | 3.1% |
| O | 25180 | 2.9% |
| Other values (57) | 199356 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 717355 | |
| Space Separator | 92144 | 10.6% |
| Decimal Number | 58829 | 6.8% |
| Lowercase Letter | 1047 | 0.1% |
| Other Punctuation | 55 | < 0.1% |
| Dash Punctuation | 17 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 156557 | |
| T | 103075 | |
| S | 69441 | |
| A | 62986 | |
| R | 61480 | 8.6% |
| N | 43496 | 6.1% |
| U | 28797 | 4.0% |
| V | 26947 | 3.8% |
| O | 25180 | 3.5% |
| W | 19805 | 2.8% |
| Other values (16) | 119591 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 237 | |
| t | 176 | |
| r | 97 | |
| n | 81 | 7.7% |
| s | 78 | 7.4% |
| a | 58 | 5.5% |
| u | 51 | 4.9% |
| o | 42 | 4.0% |
| v | 40 | 3.8% |
| d | 34 | 3.2% |
| Other values (13) | 153 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 11743 | |
| 2 | 6859 | |
| 3 | 6229 | |
| 7 | 5893 | |
| 5 | 5459 | |
| 4 | 5193 | |
| 8 | 5075 | |
| 6 | 4920 | |
| 9 | 3926 | 6.7% |
| 0 | 3532 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26 | |
| # | 14 | |
| ' | 11 | |
| & | 4 | 7.3% |
Space Separator
| Value | Count | Frequency (%) |
| 92144 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 718402 | |
| Common | 151057 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 156557 | |
| T | 103075 | |
| S | 69441 | |
| A | 62986 | |
| R | 61480 | 8.6% |
| N | 43496 | 6.1% |
| U | 28797 | 4.0% |
| V | 26947 | 3.8% |
| O | 25180 | 3.5% |
| W | 19805 | 2.8% |
| Other values (39) | 120638 |
Common
| Value | Count | Frequency (%) |
| 92144 | ||
| 1 | 11743 | 7.8% |
| 2 | 6859 | 4.5% |
| 3 | 6229 | 4.1% |
| 7 | 5893 | 3.9% |
| 5 | 5459 | 3.6% |
| 4 | 5193 | 3.4% |
| 8 | 5075 | 3.4% |
| 6 | 4920 | 3.3% |
| 9 | 3926 | 2.6% |
| Other values (8) | 3616 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 869459 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 156557 | |
| T | 103075 | |
| 92144 | ||
| S | 69441 | 8.0% |
| A | 62986 | 7.2% |
| R | 61480 | 7.1% |
| N | 43496 | 5.0% |
| U | 28797 | 3.3% |
| V | 26947 | 3.1% |
| O | 25180 | 2.9% |
| Other values (57) | 199356 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| MANHATTAN | |
|---|---|
| BROOKLYN | |
| BRONX | |
| QUEENS | |
| STATEN ISLAND | 617 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.982210828 |
| Min length | 5 |
Characters and Unicode
| Total characters | 507493 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | QUEENS |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | MANHATTAN |
| 4th row | QUEENS |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| MANHATTAN | 37205 | |
| BROOKLYN | 9332 | 14.7% |
| BRONX | 8573 | 13.5% |
| QUEENS | 7851 | 12.3% |
| STATEN ISLAND | 617 | 1.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| manhattan | 37205 | |
| brooklyn | 9332 | 14.5% |
| bronx | 8573 | 13.4% |
| queens | 7851 | 12.2% |
| staten | 617 | 1.0% |
| island | 617 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 112849 | |
| N | 101400 | |
| T | 75644 | |
| M | 37205 | 7.3% |
| H | 37205 | 7.3% |
| O | 27237 | 5.4% |
| B | 17905 | 3.5% |
| R | 17905 | 3.5% |
| E | 16319 | 3.2% |
| L | 9949 | 2.0% |
| Other values (9) | 53875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 506876 | |
| Space Separator | 617 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 112849 | |
| N | 101400 | |
| T | 75644 | |
| M | 37205 | 7.3% |
| H | 37205 | 7.3% |
| O | 27237 | 5.4% |
| B | 17905 | 3.5% |
| R | 17905 | 3.5% |
| E | 16319 | 3.2% |
| L | 9949 | 2.0% |
| Other values (8) | 53258 |
Space Separator
| Value | Count | Frequency (%) |
| 617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 506876 | |
| Common | 617 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 112849 | |
| N | 101400 | |
| T | 75644 | |
| M | 37205 | 7.3% |
| H | 37205 | 7.3% |
| O | 27237 | 5.4% |
| B | 17905 | 3.5% |
| R | 17905 | 3.5% |
| E | 16319 | 3.2% |
| L | 9949 | 2.0% |
| Other values (8) | 53258 |
Common
| Value | Count | Frequency (%) |
| 617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 507493 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 112849 | |
| N | 101400 | |
| T | 75644 | |
| M | 37205 | 7.3% |
| H | 37205 | 7.3% |
| O | 27237 | 5.4% |
| B | 17905 | 3.5% |
| R | 17905 | 3.5% |
| E | 16319 | 3.2% |
| L | 9949 | 2.0% |
| Other values (9) | 53875 |
| Distinct | 3678 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2347.533581 |
| Minimum | 1 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 864.25 |
| median | 1505 |
| Q3 | 2830 |
| 95-th percentile | 7242 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 1965.75 |
Descriptive statistics
| Standard deviation | 2702.322236 |
|---|---|
| Coefficient of variation (CV) | 1.151132516 |
| Kurtosis | 219.8598793 |
| Mean | 2347.533581 |
| Median Absolute Deviation (MAD) | 710 |
| Skewness | 7.927977306 |
| Sum | 149251490 |
| Variance | 7302545.468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3944 | 241 | 0.4% |
| 16 | 219 | 0.3% |
| 2179 | 187 | 0.3% |
| 4905 | 169 | 0.3% |
| 2180 | 162 | 0.3% |
| 8329 | 156 | 0.2% |
| 4452 | 155 | 0.2% |
| 2139 | 147 | 0.2% |
| 3943 | 145 | 0.2% |
| 1344 | 130 | 0.2% |
| Other values (3668) | 61867 |
| Value | Count | Frequency (%) |
| 1 | 39 | |
| 3 | 9 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 24 | |
| 6 | 13 | < 0.1% |
| 8 | 25 | |
| 9 | 13 | < 0.1% |
| 10 | 23 | |
| 11 | 17 | |
| 13 | 24 |
| Value | Count | Frequency (%) |
| 99999 | 8 | |
| 16234 | 7 | |
| 16233 | 5 | |
| 16231 | 5 | |
| 16230 | 10 | |
| 16229 | 6 | |
| 16228 | 1 | < 0.1% |
| 16227 | 3 | < 0.1% |
| 16226 | 10 | |
| 16186 | 4 | < 0.1% |
LOT
Real number (ℝ≥0)
| Distinct | 446 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1128.72319 |
| Minimum | 0 |
|---|---|
| Maximum | 9100 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 31 |
| Q3 | 71 |
| 95-th percentile | 7502 |
| Maximum | 9100 |
| Range | 9100 |
| Interquartile range (IQR) | 64 |
Descriptive statistics
| Standard deviation | 2632.747633 |
|---|---|
| Coefficient of variation (CV) | 2.332500701 |
| Kurtosis | 2.061266009 |
| Mean | 1128.72319 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 2.011999916 |
| Sum | 71761963 |
| Variance | 6931360.099 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 12066 | 19.0% |
| 7501 | 5021 | 7.9% |
| 7502 | 1958 | 3.1% |
| 2 | 1103 | 1.7% |
| 10 | 953 | 1.5% |
| 7503 | 950 | 1.5% |
| 20 | 905 | 1.4% |
| 29 | 884 | 1.4% |
| 21 | 856 | 1.3% |
| 15 | 799 | 1.3% |
| Other values (436) | 38083 |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 1 | 12066 | |
| 2 | 1103 | 1.7% |
| 3 | 497 | 0.8% |
| 4 | 372 | 0.6% |
| 5 | 717 | 1.1% |
| 6 | 588 | 0.9% |
| 7 | 767 | 1.2% |
| 8 | 611 | 1.0% |
| 9 | 554 | 0.9% |
| Value | Count | Frequency (%) |
| 9100 | 3 | < 0.1% |
| 9080 | 13 | |
| 9078 | 6 | < 0.1% |
| 9059 | 8 | < 0.1% |
| 9029 | 6 | < 0.1% |
| 9021 | 1 | < 0.1% |
| 9020 | 5 | < 0.1% |
| 9010 | 3 | < 0.1% |
| 9005 | 4 | < 0.1% |
| 9001 | 23 |
| Distinct | 4461 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 12356 |
| Missing (%) | 19.4% |
| Memory size | 496.8 KiB |
| 2007-02-21 00:00:00 | 1215 |
|---|---|
| 2007-02-20 00:00:00 | 733 |
| 2012-02-21 00:00:00 | 642 |
| 2017-02-21 00:00:00 | 583 |
| 2022-02-21 00:00:00 | 511 |
| Other values (4456) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 973218 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 441 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 2012-02-21 00:00:00 |
|---|---|
| 2nd row | 2012-11-07 00:00:00 |
| 3rd row | 2012-03-26 00:00:00 |
| 4th row | 2012-08-20 00:00:00 |
| 5th row | 2011-11-10 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2007-02-21 00:00:00 | 1215 | 1.9% |
| 2007-02-20 00:00:00 | 733 | 1.2% |
| 2012-02-21 00:00:00 | 642 | 1.0% |
| 2017-02-21 00:00:00 | 583 | 0.9% |
| 2022-02-21 00:00:00 | 511 | 0.8% |
| 2022-02-18 00:00:00 | 446 | 0.7% |
| 2007-02-16 00:00:00 | 408 | 0.6% |
| 2018-02-21 00:00:00 | 405 | 0.6% |
| 2019-02-21 00:00:00 | 367 | 0.6% |
| 2012-08-21 00:00:00 | 351 | 0.6% |
| Other values (4451) | 45561 | |
| (Missing) | 12356 | 19.4% |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 51055 | |
| 2007-02-21 | 1215 | 1.2% |
| 2007-02-20 | 733 | 0.7% |
| 2012-02-21 | 642 | 0.6% |
| 2017-02-21 | 583 | 0.6% |
| 2022-02-21 | 511 | 0.5% |
| 2022-02-18 | 446 | 0.4% |
| 2007-02-16 | 408 | 0.4% |
| 2018-02-21 | 405 | 0.4% |
| 2019-02-21 | 367 | 0.4% |
| Other values (4453) | 46079 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 437086 | |
| 2 | 111981 | 11.5% |
| - | 102444 | 10.5% |
| : | 102444 | 10.5% |
| 1 | 78887 | 8.1% |
| 51222 | 5.3% | |
| 7 | 18387 | 1.9% |
| 8 | 14136 | 1.5% |
| 3 | 13435 | 1.4% |
| 6 | 12560 | 1.3% |
| Other values (3) | 30636 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 717108 | |
| Dash Punctuation | 102444 | 10.5% |
| Other Punctuation | 102444 | 10.5% |
| Space Separator | 51222 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 437086 | |
| 2 | 111981 | 15.6% |
| 1 | 78887 | 11.0% |
| 7 | 18387 | 2.6% |
| 8 | 14136 | 2.0% |
| 3 | 13435 | 1.9% |
| 6 | 12560 | 1.8% |
| 9 | 12324 | 1.7% |
| 5 | 10077 | 1.4% |
| 4 | 8235 | 1.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102444 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 102444 |
Space Separator
| Value | Count | Frequency (%) |
| 51222 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 973218 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 437086 | |
| 2 | 111981 | 11.5% |
| - | 102444 | 10.5% |
| : | 102444 | 10.5% |
| 1 | 78887 | 8.1% |
| 51222 | 5.3% | |
| 7 | 18387 | 1.9% |
| 8 | 14136 | 1.5% |
| 3 | 13435 | 1.4% |
| 6 | 12560 | 1.3% |
| Other values (3) | 30636 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 973218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 437086 | |
| 2 | 111981 | 11.5% |
| - | 102444 | 10.5% |
| : | 102444 | 10.5% |
| 1 | 78887 | 8.1% |
| 51222 | 5.3% | |
| 7 | 18387 | 1.9% |
| 8 | 14136 | 1.5% |
| 3 | 13435 | 1.4% |
| 6 | 12560 | 1.3% |
| Other values (3) | 30636 | 3.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 271 |
| Missing (%) | 0.4% |
| Memory size | 496.8 KiB |
| SAFE | |
|---|---|
| SWARMP | |
| No Report Filed | |
| UNSAFE |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 6.157075837 |
| Min length | 4 |
Characters and Unicode
| Total characters | 389786 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Report Filed |
|---|---|
| 2nd row | UNSAFE |
| 3rd row | No Report Filed |
| 4th row | No Report Filed |
| 5th row | SAFE |
Common Values
| Value | Count | Frequency (%) |
| SAFE | 29138 | |
| SWARMP | 21423 | |
| No Report Filed | 7580 | 11.9% |
| UNSAFE | 5166 | 8.1% |
| (Missing) | 271 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| safe | 29138 | |
| swarmp | 21423 | |
| no | 7580 | 9.7% |
| report | 7580 | 9.7% |
| filed | 7580 | 9.7% |
| unsafe | 5166 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 55727 | |
| A | 55727 | |
| F | 41884 | |
| E | 34304 | |
| R | 29003 | 7.4% |
| W | 21423 | 5.5% |
| M | 21423 | 5.5% |
| P | 21423 | 5.5% |
| 15160 | 3.9% | |
| e | 15160 | 3.9% |
| Other values (9) | 78552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 298826 | |
| Lowercase Letter | 75800 | 19.4% |
| Space Separator | 15160 | 3.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 55727 | |
| A | 55727 | |
| F | 41884 | |
| E | 34304 | |
| R | 29003 | |
| W | 21423 | 7.2% |
| M | 21423 | 7.2% |
| P | 21423 | 7.2% |
| N | 12746 | 4.3% |
| U | 5166 | 1.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15160 | |
| o | 15160 | |
| p | 7580 | |
| r | 7580 | |
| t | 7580 | |
| i | 7580 | |
| l | 7580 | |
| d | 7580 |
Space Separator
| Value | Count | Frequency (%) |
| 15160 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 374626 | |
| Common | 15160 | 3.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 55727 | |
| A | 55727 | |
| F | 41884 | |
| E | 34304 | |
| R | 29003 | |
| W | 21423 | 5.7% |
| M | 21423 | 5.7% |
| P | 21423 | 5.7% |
| e | 15160 | 4.0% |
| o | 15160 | 4.0% |
| Other values (8) | 63392 |
Common
| Value | Count | Frequency (%) |
| 15160 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 389786 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 55727 | |
| A | 55727 | |
| F | 41884 | |
| E | 34304 | |
| R | 29003 | 7.4% |
| W | 21423 | 5.5% |
| M | 21423 | 5.5% |
| P | 21423 | 5.5% |
| 15160 | 3.9% | |
| e | 15160 | 3.9% |
| Other values (9) | 78552 |
| Distinct | 2111 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 13769 |
| Missing (%) | 21.7% |
| Memory size | 496.8 KiB |
| PAUL MILLMAN | 1337 |
|---|---|
| ALAN S EPSTEIN | 1139 |
| HOWARD L ZIMMERMAN | 897 |
| TIMOTHY WEBB | 696 |
| ANTHONY STASIO | 695 |
| Other values (2106) |
Length
| Max length | 27 |
|---|---|
| Median length | 24 |
| Mean length | 14.42695095 |
| Min length | 1 |
Characters and Unicode
| Total characters | 718592 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 846 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | PAUL MILLMAN |
|---|---|
| 2nd row | JAMES CICALO |
| 3rd row | JAMES MODDY |
| 4th row | CHARLES A MERRITT |
| 5th row | STANFORD CHAN |
Common Values
| Value | Count | Frequency (%) |
| PAUL MILLMAN | 1337 | 2.1% |
| ALAN S EPSTEIN | 1139 | 1.8% |
| HOWARD L ZIMMERMAN | 897 | 1.4% |
| TIMOTHY WEBB | 696 | 1.1% |
| ANTHONY STASIO | 695 | 1.1% |
| HOWARD ZIMMERMAN | 689 | 1.1% |
| BARIS ACAR | 665 | 1.0% |
| CHARLES A MERRITT | 623 | 1.0% |
| DAVID SALAMON | 610 | 1.0% |
| JOSEPH CANTON | 563 | 0.9% |
| Other values (2101) | 41895 | |
| (Missing) | 13769 | 21.7% |
Length
| Value | Count | Frequency (%) |
| j | 2274 | 2.0% |
| s | 2081 | 1.8% |
| a | 2015 | 1.8% |
| alan | 1796 | 1.6% |
| l | 1793 | 1.6% |
| robert | 1708 | 1.5% |
| howard | 1622 | 1.4% |
| zimmerman | 1597 | 1.4% |
| epstein | 1575 | 1.4% |
| paul | 1522 | 1.3% |
| Other values (1772) | 95686 |
Most occurring characters
| Value | Count | Frequency (%) |
| 97197 | ||
| A | 76681 | 10.7% |
| E | 60423 | 8.4% |
| N | 51612 | 7.2% |
| R | 49037 | 6.8% |
| I | 45841 | 6.4% |
| L | 39414 | 5.5% |
| O | 38168 | 5.3% |
| S | 35377 | 4.9% |
| M | 30123 | 4.2% |
| Other values (45) | 194719 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 620310 | |
| Space Separator | 97197 | 13.5% |
| Other Punctuation | 717 | 0.1% |
| Dash Punctuation | 264 | < 0.1% |
| Lowercase Letter | 64 | < 0.1% |
| Decimal Number | 33 | < 0.1% |
| Modifier Symbol | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 76681 | |
| E | 60423 | 9.7% |
| N | 51612 | 8.3% |
| R | 49037 | 7.9% |
| I | 45841 | 7.4% |
| L | 39414 | 6.4% |
| O | 38168 | 6.2% |
| S | 35377 | 5.7% |
| M | 30123 | 4.9% |
| T | 28739 | 4.6% |
| Other values (16) | 164895 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10 | |
| l | 8 | |
| e | 7 | |
| a | 6 | |
| o | 5 | |
| k | 4 | 6.2% |
| r | 4 | 6.2% |
| m | 4 | 6.2% |
| n | 3 | 4.7% |
| c | 3 | 4.7% |
| Other values (6) | 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 446 | |
| . | 134 | 18.7% |
| , | 130 | 18.1% |
| ? | 4 | 0.6% |
| : | 2 | 0.3% |
| # | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 27 | |
| 1 | 3 | 9.1% |
| 7 | 2 | 6.1% |
| 2 | 1 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 97197 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 264 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 620374 | |
| Common | 98218 | 13.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 76681 | |
| E | 60423 | 9.7% |
| N | 51612 | 8.3% |
| R | 49037 | 7.9% |
| I | 45841 | 7.4% |
| L | 39414 | 6.4% |
| O | 38168 | 6.2% |
| S | 35377 | 5.7% |
| M | 30123 | 4.9% |
| T | 28739 | 4.6% |
| Other values (32) | 164959 |
Common
| Value | Count | Frequency (%) |
| 97197 | ||
| ' | 446 | 0.5% |
| - | 264 | 0.3% |
| . | 134 | 0.1% |
| , | 130 | 0.1% |
| 3 | 27 | < 0.1% |
| ` | 7 | < 0.1% |
| ? | 4 | < 0.1% |
| 1 | 3 | < 0.1% |
| : | 2 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 718592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 97197 | ||
| A | 76681 | 10.7% |
| E | 60423 | 8.4% |
| N | 51612 | 7.2% |
| R | 49037 | 6.8% |
| I | 45841 | 6.4% |
| L | 39414 | 5.5% |
| O | 38168 | 5.3% |
| S | 35377 | 4.9% |
| M | 30123 | 4.2% |
| Other values (45) | 194719 |
| Distinct | 4694 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 14523 |
| Missing (%) | 22.8% |
| Memory size | 496.8 KiB |
| EPSTEIN ENGINEERING, P.C | 904 |
|---|---|
| SUPERSTRUCTURES ENG. & ARCH | 698 |
| RAND ENGINEERING & ARCHITECTURE | 626 |
| MERRITT ENGINEERING CONSULTANTS | 603 |
| LAWLESS & MANGIONE, LLP | 591 |
| Other values (4689) |
Length
| Max length | 43 |
|---|---|
| Median length | 32 |
| Mean length | 23.82915095 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1168939 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2159 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | SUPERSTRUCTURES ENG & ARCH |
|---|---|
| 2nd row | FSI ARCHITECTURE, PC |
| 3rd row | HEITMANN & ASSOCIATES, INC |
| 4th row | MERRITT.ENGINEERING CONSULTANTS |
| 5th row | IBA ARCHITECTS, PLLC |
Common Values
| Value | Count | Frequency (%) |
| EPSTEIN ENGINEERING, P.C | 904 | 1.4% |
| SUPERSTRUCTURES ENG. & ARCH | 698 | 1.1% |
| RAND ENGINEERING & ARCHITECTURE | 626 | 1.0% |
| MERRITT ENGINEERING CONSULTANTS | 603 | 0.9% |
| LAWLESS & MANGIONE, LLP | 591 | 0.9% |
| HLZIMMERMAN ARCHITECTS | 567 | 0.9% |
| GANDHI ENGINEERING INC | 497 | 0.8% |
| DEVON ARCHITECTS | 447 | 0.7% |
| SALAMON ENGINEERING PLLC | 444 | 0.7% |
| RAND ENGINEERING & ARCHITECT | 411 | 0.6% |
| Other values (4684) | 43267 | |
| (Missing) | 14523 | 22.8% |
Length
| Value | Count | Frequency (%) |
| engineering | 14177 | 8.7% |
| 9121 | 5.6% | |
| p.c | 6007 | 3.7% |
| architects | 5766 | 3.5% |
| pc | 5613 | 3.4% |
| architect | 4013 | 2.5% |
| architecture | 3434 | 2.1% |
| inc | 3387 | 2.1% |
| arch | 2901 | 1.8% |
| associates | 2413 | 1.5% |
| Other values (2685) | 106433 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 132080 | |
| 114328 | 9.8% | |
| N | 111860 | 9.6% |
| I | 89283 | 7.6% |
| R | 80811 | 6.9% |
| C | 80176 | 6.9% |
| A | 73568 | 6.3% |
| T | 70169 | 6.0% |
| S | 58933 | 5.0% |
| G | 51908 | 4.4% |
| Other values (64) | 305823 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 999718 | |
| Space Separator | 114328 | 9.8% |
| Other Punctuation | 50552 | 4.3% |
| Lowercase Letter | 3199 | 0.3% |
| Math Symbol | 522 | < 0.1% |
| Dash Punctuation | 427 | < 0.1% |
| Decimal Number | 187 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 132080 | |
| N | 111860 | |
| I | 89283 | |
| R | 80811 | 8.1% |
| C | 80176 | 8.0% |
| A | 73568 | 7.4% |
| T | 70169 | 7.0% |
| S | 58933 | 5.9% |
| G | 51908 | 5.2% |
| L | 41622 | 4.2% |
| Other values (16) | 209308 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 606 | |
| m | 589 | |
| p | 585 | |
| e | 248 | |
| n | 216 | 6.8% |
| i | 199 | 6.2% |
| r | 169 | 5.3% |
| g | 130 | 4.1% |
| t | 109 | 3.4% |
| c | 108 | 3.4% |
| Other values (13) | 240 | 7.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 67 | |
| 2 | 42 | |
| 7 | 19 | 10.2% |
| 4 | 17 | 9.1% |
| 0 | 16 | 8.6% |
| 5 | 7 | 3.7% |
| 3 | 6 | 3.2% |
| 9 | 5 | 2.7% |
| 8 | 4 | 2.1% |
| 6 | 4 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20031 | |
| , | 19010 | |
| & | 10226 | |
| ; | 588 | 1.2% |
| / | 370 | 0.7% |
| ' | 318 | 0.6% |
| % | 6 | < 0.1% |
| @ | 3 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 521 | |
| = | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 114328 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 427 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1002917 | |
| Common | 166022 | 14.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 132080 | |
| N | 111860 | |
| I | 89283 | |
| R | 80811 | 8.1% |
| C | 80176 | 8.0% |
| A | 73568 | 7.3% |
| T | 70169 | 7.0% |
| S | 58933 | 5.9% |
| G | 51908 | 5.2% |
| L | 41622 | 4.2% |
| Other values (39) | 212507 |
Common
| Value | Count | Frequency (%) |
| 114328 | ||
| . | 20031 | 12.1% |
| , | 19010 | 11.5% |
| & | 10226 | 6.2% |
| ; | 588 | 0.4% |
| + | 521 | 0.3% |
| - | 427 | 0.3% |
| / | 370 | 0.2% |
| ' | 318 | 0.2% |
| 1 | 67 | < 0.1% |
| Other values (15) | 136 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1168939 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 132080 | |
| 114328 | 9.8% | |
| N | 111860 | 9.6% |
| I | 89283 | 7.6% |
| R | 80811 | 6.9% |
| C | 80176 | 6.9% |
| A | 73568 | 6.3% |
| T | 70169 | 6.0% |
| S | 58933 | 5.0% |
| G | 51908 | 4.4% |
| Other values (64) | 305823 |
| Distinct | 3901 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 12392 |
| Missing (%) | 19.5% |
| Memory size | 496.8 KiB |
| 480 NORTH BROADWAY | 1848 |
|---|---|
| 159 WEST 25TH STREET | 1023 |
| 317 MADISON AVENUE, SUITE 915 | 812 |
| 11 WEST 30TH STREET | 661 |
| 11 W 30 ST | 631 |
| Other values (3896) |
Length
| Max length | 42 |
|---|---|
| Median length | 38 |
| Mean length | 20.24436369 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1036228 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1695 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | 32 AVENUE OF THE AMERICAS |
|---|---|
| 2nd row | 307 7TH AVENUE, SUITE 1001 |
| 3rd row | 20 WEST 22ND STREET, 17TH FLOOR |
| 4th row | 28-08 BAYSIDE LANE |
| 5th row | 232 MADISON AVENUE |
Common Values
| Value | Count | Frequency (%) |
| 480 NORTH BROADWAY | 1848 | 2.9% |
| 159 WEST 25TH STREET | 1023 | 1.6% |
| 317 MADISON AVENUE, SUITE 915 | 812 | 1.3% |
| 11 WEST 30TH STREET | 661 | 1.0% |
| 11 W 30 ST | 631 | 1.0% |
| 111 JOHN STREET | 611 | 1.0% |
| 152 MADISON AVENUE | 584 | 0.9% |
| 159 WEST 25TH STREET, 12TH FLOOR | 576 | 0.9% |
| 32 AVENUE OF THE AMERICAS | 525 | 0.8% |
| 28-08 BAYSIDE LANE | 516 | 0.8% |
| Other values (3891) | 43399 | |
| (Missing) | 12392 | 19.5% |
Length
| Value | Count | Frequency (%) |
| street | 19685 | 10.1% |
| avenue | 13118 | 6.7% |
| west | 12748 | 6.5% |
| suite | 4972 | 2.5% |
| broadway | 4428 | 2.3% |
| madison | 3637 | 1.9% |
| floor | 3209 | 1.6% |
| ave | 2449 | 1.3% |
| north | 2305 | 1.2% |
| road | 2287 | 1.2% |
| Other values (2288) | 126900 |
Most occurring characters
| Value | Count | Frequency (%) |
| 162407 | ||
| E | 113425 | 10.9% |
| T | 94290 | 9.1% |
| S | 57901 | 5.6% |
| A | 53124 | 5.1% |
| R | 48909 | 4.7% |
| 1 | 41111 | 4.0% |
| N | 35419 | 3.4% |
| O | 32644 | 3.2% |
| 2 | 28965 | 2.8% |
| Other values (59) | 368033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 647409 | |
| Decimal Number | 209188 | 20.2% |
| Space Separator | 162407 | 15.7% |
| Other Punctuation | 12496 | 1.2% |
| Dash Punctuation | 3823 | 0.4% |
| Lowercase Letter | 531 | 0.1% |
| Open Punctuation | 182 | < 0.1% |
| Close Punctuation | 176 | < 0.1% |
| Modifier Symbol | 16 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 113425 | |
| T | 94290 | |
| S | 57901 | |
| A | 53124 | 8.2% |
| R | 48909 | 7.6% |
| N | 35419 | 5.5% |
| O | 32644 | 5.0% |
| H | 27794 | 4.3% |
| U | 24572 | 3.8% |
| I | 23125 | 3.6% |
| Other values (16) | 136206 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 107 | |
| e | 80 | |
| r | 53 | |
| h | 48 | |
| o | 45 | |
| a | 33 | 6.2% |
| s | 26 | 4.9% |
| u | 25 | 4.7% |
| i | 23 | 4.3% |
| n | 21 | 4.0% |
| Other values (10) | 70 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 41111 | |
| 2 | 28965 | |
| 0 | 27444 | |
| 3 | 25191 | |
| 5 | 20793 | |
| 4 | 17541 | |
| 8 | 14108 | 6.7% |
| 9 | 12959 | 6.2% |
| 6 | 11157 | 5.3% |
| 7 | 9919 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9393 | |
| . | 2143 | 17.1% |
| # | 903 | 7.2% |
| ' | 31 | 0.2% |
| & | 16 | 0.1% |
| / | 6 | < 0.1% |
| ; | 2 | < 0.1% |
| @ | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 162407 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3823 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 182 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 176 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 647940 | |
| Common | 388288 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 113425 | |
| T | 94290 | |
| S | 57901 | |
| A | 53124 | 8.2% |
| R | 48909 | 7.5% |
| N | 35419 | 5.5% |
| O | 32644 | 5.0% |
| H | 27794 | 4.3% |
| U | 24572 | 3.8% |
| I | 23125 | 3.6% |
| Other values (36) | 136737 |
Common
| Value | Count | Frequency (%) |
| 162407 | ||
| 1 | 41111 | 10.6% |
| 2 | 28965 | 7.5% |
| 0 | 27444 | 7.1% |
| 3 | 25191 | 6.5% |
| 5 | 20793 | 5.4% |
| 4 | 17541 | 4.5% |
| 8 | 14108 | 3.6% |
| 9 | 12959 | 3.3% |
| 6 | 11157 | 2.9% |
| Other values (13) | 26612 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1036228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 162407 | ||
| E | 113425 | 10.9% |
| T | 94290 | 9.1% |
| S | 57901 | 5.6% |
| A | 53124 | 5.1% |
| R | 48909 | 4.7% |
| 1 | 41111 | 4.0% |
| N | 35419 | 3.4% |
| O | 32644 | 3.2% |
| 2 | 28965 | 2.8% |
| Other values (59) | 368033 |
| Distinct | 541 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 12887 |
| Missing (%) | 20.3% |
| Memory size | 496.8 KiB |
| NEW YORK | |
|---|---|
| YONKERS | 2173 |
| NY | 1797 |
| BROOKLYN | 1777 |
| BAYSIDE | 1046 |
| Other values (536) |
Length
| Max length | 18 |
|---|---|
| Median length | 8 |
| Mean length | 8.08244067 |
| Min length | 2 |
Characters and Unicode
| Total characters | 409707 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 209 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | NEW YORK |
|---|---|
| 2nd row | NEW YORK |
| 3rd row | NEW YORK |
| 4th row | BAYSIDE |
| 5th row | NEW YORK |
Common Values
| Value | Count | Frequency (%) |
| NEW YORK | 29603 | |
| YONKERS | 2173 | 3.4% |
| NY | 1797 | 2.8% |
| BROOKLYN | 1777 | 2.8% |
| BAYSIDE | 1046 | 1.6% |
| FLUSHING | 694 | 1.1% |
| STATEN ISLAND | 603 | 0.9% |
| NEW ROCHELLE | 546 | 0.9% |
| GREAT NECK | 475 | 0.7% |
| LONG ISLAND CITY | 401 | 0.6% |
| Other values (531) | 11576 | 18.2% |
| (Missing) | 12887 |
Length
| Value | Count | Frequency (%) |
| new | 30502 | |
| york | 29747 | |
| yonkers | 2178 | 2.5% |
| ny | 1805 | 2.1% |
| brooklyn | 1779 | 2.1% |
| island | 1155 | 1.3% |
| bayside | 1047 | 1.2% |
| city | 872 | 1.0% |
| flushing | 694 | 0.8% |
| staten | 606 | 0.7% |
| Other values (511) | 16274 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 45747 | |
| E | 45745 | |
| O | 44636 | |
| R | 40948 | |
| Y | 39788 | |
| 35992 | ||
| K | 35691 | |
| W | 32773 | |
| A | 11444 | 2.8% |
| S | 11443 | 2.8% |
| Other values (46) | 65500 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 372891 | |
| Space Separator | 35992 | 8.8% |
| Other Punctuation | 502 | 0.1% |
| Lowercase Letter | 279 | 0.1% |
| Decimal Number | 42 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 45747 | |
| E | 45745 | |
| O | 44636 | |
| R | 40948 | |
| Y | 39788 | |
| K | 35691 | |
| W | 32773 | |
| A | 11444 | 3.1% |
| S | 11443 | 3.1% |
| L | 10860 | 2.9% |
| Other values (16) | 53816 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 51 | |
| e | 48 | |
| r | 41 | |
| k | 38 | |
| w | 37 | |
| t | 9 | 3.2% |
| l | 9 | 3.2% |
| n | 9 | 3.2% |
| y | 7 | 2.5% |
| s | 6 | 2.2% |
| Other values (10) | 24 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 26 | |
| 5 | 13 | |
| 0 | 1 | 2.4% |
| 3 | 1 | 2.4% |
| 8 | 1 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 481 | |
| , | 20 | 4.0% |
| ' | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 35992 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 373170 | |
| Common | 36537 | 8.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 45747 | |
| E | 45745 | |
| O | 44636 | |
| R | 40948 | |
| Y | 39788 | |
| K | 35691 | |
| W | 32773 | |
| A | 11444 | 3.1% |
| S | 11443 | 3.1% |
| L | 10860 | 2.9% |
| Other values (36) | 54095 |
Common
| Value | Count | Frequency (%) |
| 35992 | ||
| . | 481 | 1.3% |
| 2 | 26 | 0.1% |
| , | 20 | 0.1% |
| 5 | 13 | < 0.1% |
| 0 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| - | 1 | < 0.1% |
| ' | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 409707 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 45747 | |
| E | 45745 | |
| O | 44636 | |
| R | 40948 | |
| Y | 39788 | |
| 35992 | ||
| K | 35691 | |
| W | 32773 | |
| A | 11444 | 2.8% |
| S | 11443 | 2.8% |
| Other values (46) | 65500 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12395 |
| Missing (%) | 19.5% |
| Memory size | 496.8 KiB |
| NY | |
|---|---|
| NJ | 3548 |
| CT | 419 |
| N. | 112 |
| VA | 41 |
| Other values (11) | 99 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.000195377 |
| Min length | 2 |
Characters and Unicode
| Total characters | 102376 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | NY |
| 3rd row | NY |
| 4th row | NY |
| 5th row | NY |
Common Values
| Value | Count | Frequency (%) |
| NY | 46964 | |
| NJ | 3548 | 5.6% |
| CT | 419 | 0.7% |
| N. | 112 | 0.2% |
| VA | 41 | 0.1% |
| MD | 25 | < 0.1% |
| FL | 24 | < 0.1% |
| PA | 18 | < 0.1% |
| IL | 14 | < 0.1% |
| NE | 6 | < 0.1% |
| Other values (6) | 12 | < 0.1% |
| (Missing) | 12395 | 19.5% |
Length
| Value | Count | Frequency (%) |
| ny | 46964 | |
| nj | 3548 | 6.9% |
| ct | 419 | 0.8% |
| n | 112 | 0.2% |
| va | 41 | 0.1% |
| md | 25 | < 0.1% |
| fl | 24 | < 0.1% |
| pa | 18 | < 0.1% |
| il | 14 | < 0.1% |
| ne | 6 | < 0.1% |
| Other values (6) | 12 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 50635 | |
| Y | 46964 | |
| J | 3548 | 3.5% |
| C | 422 | 0.4% |
| T | 420 | 0.4% |
| . | 112 | 0.1% |
| A | 60 | 0.1% |
| V | 41 | < 0.1% |
| L | 38 | < 0.1% |
| D | 32 | < 0.1% |
| Other values (14) | 104 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 102252 | |
| Other Punctuation | 112 | 0.1% |
| Lowercase Letter | 12 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 50635 | |
| Y | 46964 | |
| J | 3548 | 3.5% |
| C | 422 | 0.4% |
| T | 420 | 0.4% |
| A | 60 | 0.1% |
| V | 41 | < 0.1% |
| L | 38 | < 0.1% |
| D | 32 | < 0.1% |
| F | 26 | < 0.1% |
| Other values (7) | 66 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 2 | |
| r | 2 | |
| i | 2 | |
| d | 2 | |
| a | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 102264 | |
| Common | 112 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 50635 | |
| Y | 46964 | |
| J | 3548 | 3.5% |
| C | 422 | 0.4% |
| T | 420 | 0.4% |
| A | 60 | 0.1% |
| V | 41 | < 0.1% |
| L | 38 | < 0.1% |
| D | 32 | < 0.1% |
| F | 26 | < 0.1% |
| Other values (13) | 78 | 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 112 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 102376 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 50635 | |
| Y | 46964 | |
| J | 3548 | 3.5% |
| C | 422 | 0.4% |
| T | 420 | 0.4% |
| . | 112 | 0.1% |
| A | 60 | 0.1% |
| V | 41 | < 0.1% |
| L | 38 | < 0.1% |
| D | 32 | < 0.1% |
| Other values (14) | 104 | 0.1% |
| Distinct | 233 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 44195 |
| Missing (%) | 69.5% |
| Memory size | 496.8 KiB |
| 10001 | |
|---|---|
| 10018 | |
| 10016 | 1032 |
| 10013 | 795 |
| 10701 | 734 |
| Other values (228) |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.026466491 |
| Min length | 2 |
Characters and Unicode
| Total characters | 97428 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | 07666 |
| 3rd row | NY |
| 4th row | 10013 |
| 5th row | 11563 |
Common Values
| Value | Count | Frequency (%) |
| 10001 | 3644 | 5.7% |
| 10018 | 2526 | 4.0% |
| 10016 | 1032 | 1.6% |
| 10013 | 795 | 1.3% |
| 10701 | 734 | 1.2% |
| 10011 | 613 | 1.0% |
| 10010 | 590 | 0.9% |
| 11358 | 463 | 0.7% |
| 10025 | 403 | 0.6% |
| 10017 | 391 | 0.6% |
| Other values (223) | 8192 | 12.9% |
| (Missing) | 44195 |
Length
| Value | Count | Frequency (%) |
| 10001 | 3644 | |
| 10018 | 2526 | 13.0% |
| 10016 | 1032 | 5.3% |
| 10013 | 795 | 4.1% |
| 10701 | 734 | 3.8% |
| 10011 | 613 | 3.2% |
| 10010 | 590 | 3.0% |
| 11358 | 463 | 2.4% |
| 10025 | 403 | 2.1% |
| 10017 | 391 | 2.0% |
| Other values (223) | 8192 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 36594 | |
| 1 | 34692 | |
| 7 | 4801 | 4.9% |
| 8 | 4515 | 4.6% |
| 3 | 4390 | 4.5% |
| 2 | 3797 | 3.9% |
| 5 | 3416 | 3.5% |
| 6 | 3085 | 3.2% |
| 4 | 1108 | 1.1% |
| 9 | 940 | 1.0% |
| Other values (3) | 90 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 97338 | |
| Dash Punctuation | 86 | 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 36594 | |
| 1 | 34692 | |
| 7 | 4801 | 4.9% |
| 8 | 4515 | 4.6% |
| 3 | 4390 | 4.5% |
| 2 | 3797 | 3.9% |
| 5 | 3416 | 3.5% |
| 6 | 3085 | 3.2% |
| 4 | 1108 | 1.1% |
| 9 | 940 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| Y | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 97424 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 36594 | |
| 1 | 34692 | |
| 7 | 4801 | 4.9% |
| 8 | 4515 | 4.6% |
| 3 | 4390 | 4.5% |
| 2 | 3797 | 3.9% |
| 5 | 3416 | 3.5% |
| 6 | 3085 | 3.2% |
| 4 | 1108 | 1.1% |
| 9 | 940 | 1.0% |
Latin
| Value | Count | Frequency (%) |
| N | 2 | |
| Y | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 36594 | |
| 1 | 34692 | |
| 7 | 4801 | 4.9% |
| 8 | 4515 | 4.6% |
| 3 | 4390 | 4.5% |
| 2 | 3797 | 3.9% |
| 5 | 3416 | 3.5% |
| 6 | 3085 | 3.2% |
| 4 | 1108 | 1.1% |
| 9 | 940 | 1.0% |
| Other values (3) | 90 | 0.1% |
| Distinct | 471 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 44173 |
| Missing (%) | 69.5% |
| Memory size | 496.8 KiB |
| RA - 014327 | 632 |
|---|---|
| PE - 088730 | 419 |
| PE - 088575 | 393 |
| RA - 031877 | 366 |
| PE - 058384 | 344 |
| Other values (466) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.9996908 |
| Min length | 10 |
Characters and Unicode
| Total characters | 213449 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 63 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | PE - 079953 |
|---|---|
| 2nd row | PE - 079953 |
| 3rd row | PE - 079953 |
| 4th row | L - 611208 |
| 5th row | RA - 039518 |
Common Values
| Value | Count | Frequency (%) |
| RA - 014327 | 632 | 1.0% |
| PE - 088730 | 419 | 0.7% |
| PE - 088575 | 393 | 0.6% |
| RA - 031877 | 366 | 0.6% |
| PE - 058384 | 344 | 0.5% |
| PE - 067415 | 341 | 0.5% |
| PE - 048838 | 338 | 0.5% |
| RA - 017183 | 325 | 0.5% |
| PE - 088950 | 321 | 0.5% |
| PE - 084457 | 309 | 0.5% |
| Other values (461) | 15617 | 24.6% |
| (Missing) | 44173 |
Length
| Value | Count | Frequency (%) |
| 19405 | ||
| pe | 10747 | |
| ra | 8652 | |
| 014327 | 632 | 1.1% |
| 088730 | 419 | 0.7% |
| 088575 | 393 | 0.7% |
| 031877 | 366 | 0.6% |
| 058384 | 344 | 0.6% |
| 067415 | 341 | 0.6% |
| 048838 | 338 | 0.6% |
| Other values (466) | 16578 |
Most occurring characters
| Value | Count | Frequency (%) |
| 38810 | ||
| 0 | 27442 | |
| - | 19405 | 9.1% |
| 8 | 12375 | 5.8% |
| 7 | 11253 | 5.3% |
| 3 | 10765 | 5.0% |
| P | 10747 | 5.0% |
| E | 10747 | 5.0% |
| 1 | 10034 | 4.7% |
| 4 | 10005 | 4.7% |
| Other values (8) | 51866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 116430 | |
| Space Separator | 38810 | 18.2% |
| Uppercase Letter | 38804 | 18.2% |
| Dash Punctuation | 19405 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 27442 | |
| 8 | 12375 | |
| 7 | 11253 | |
| 3 | 10765 | 9.2% |
| 1 | 10034 | 8.6% |
| 4 | 10005 | 8.6% |
| 2 | 9480 | 8.1% |
| 5 | 8683 | 7.5% |
| 9 | 8599 | 7.4% |
| 6 | 7794 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 10747 | |
| E | 10747 | |
| R | 8653 | |
| A | 8652 | |
| X | 4 | < 0.1% |
| L | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 38810 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 174645 | |
| Latin | 38804 | 18.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 38810 | ||
| 0 | 27442 | |
| - | 19405 | |
| 8 | 12375 | 7.1% |
| 7 | 11253 | 6.4% |
| 3 | 10765 | 6.2% |
| 1 | 10034 | 5.7% |
| 4 | 10005 | 5.7% |
| 2 | 9480 | 5.4% |
| 5 | 8683 | 5.0% |
| Other values (2) | 16393 |
Latin
| Value | Count | Frequency (%) |
| P | 10747 | |
| E | 10747 | |
| R | 8653 | |
| A | 8652 | |
| X | 4 | < 0.1% |
| L | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 213449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 38810 | ||
| 0 | 27442 | |
| - | 19405 | 9.1% |
| 8 | 12375 | 5.8% |
| 7 | 11253 | 5.3% |
| 3 | 10765 | 5.0% |
| P | 10747 | 5.0% |
| E | 10747 | 5.0% |
| 1 | 10034 | 4.7% |
| 4 | 10005 | 4.7% |
| Other values (8) | 51866 |
| Distinct | 6058 |
|---|---|
| Distinct (%) | 31.2% |
| Missing | 44158 |
| Missing (%) | 69.5% |
| Memory size | 496.8 KiB |
| LLOYD VALDEZ | 734 |
|---|---|
| GARY GUILLAUME | 539 |
| RICHARD MORRISON | 209 |
| MARTHA BRAZOBAN | 202 |
| EDWARD MCARTHUR | 172 |
| Other values (6053) |
Length
| Max length | 51 |
|---|---|
| Median length | 39 |
| Mean length | 14.58470649 |
| Min length | 6 |
Characters and Unicode
| Total characters | 283235 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3271 ? |
|---|---|
| Unique (%) | 16.8% |
Sample
| 1st row | GOLDSTEIN STUART |
|---|---|
| 2nd row | JEFF FARKAS |
| 3rd row | ETEM BIZATI |
| 4th row | NUTTUALL ELEANOR |
| 5th row | GAZIVODA ANTHONY |
Common Values
| Value | Count | Frequency (%) |
| LLOYD VALDEZ | 734 | 1.2% |
| GARY GUILLAUME | 539 | 0.8% |
| RICHARD MORRISON | 209 | 0.3% |
| MARTHA BRAZOBAN | 202 | 0.3% |
| EDWARD MCARTHUR | 172 | 0.3% |
| MICHAEL WOLFE | 113 | 0.2% |
| MARY FRANCES SHAUGHNESSY | 108 | 0.2% |
| PHILLIP WISCHERTH | 100 | 0.2% |
| JUAN R. TORRES | 97 | 0.2% |
| FIRSTSERVICE RESIDENTIAL | 83 | 0.1% |
| Other values (6048) | 17063 | 26.8% |
| (Missing) | 44158 |
Length
| Value | Count | Frequency (%) |
| lloyd | 781 | 2.0% |
| valdez | 736 | 1.8% |
| michael | 682 | 1.7% |
| gary | 576 | 1.4% |
| guillaume | 542 | 1.4% |
| richard | 486 | 1.2% |
| david | 465 | 1.2% |
| john | 404 | 1.0% |
| joseph | 294 | 0.7% |
| robert | 288 | 0.7% |
| Other values (6205) | 34590 |
Most occurring characters
| Value | Count | Frequency (%) |
| 39569 | ||
| A | 28804 | 10.2% |
| E | 24719 | 8.7% |
| R | 20756 | 7.3% |
| N | 17780 | 6.3% |
| I | 16515 | 5.8% |
| L | 15745 | 5.6% |
| O | 14796 | 5.2% |
| S | 12930 | 4.6% |
| T | 9780 | 3.5% |
| Other values (60) | 81841 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 242283 | |
| Space Separator | 39569 | 14.0% |
| Decimal Number | 624 | 0.2% |
| Other Punctuation | 407 | 0.1% |
| Lowercase Letter | 270 | 0.1% |
| Dash Punctuation | 75 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 28804 | |
| E | 24719 | 10.2% |
| R | 20756 | 8.6% |
| N | 17780 | 7.3% |
| I | 16515 | 6.8% |
| L | 15745 | 6.5% |
| O | 14796 | 6.1% |
| S | 12930 | 5.3% |
| T | 9780 | 4.0% |
| M | 9548 | 3.9% |
| Other values (16) | 70910 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 47 | |
| r | 36 | |
| k | 30 | |
| e | 29 | |
| w | 20 | |
| n | 18 | 6.7% |
| a | 17 | 6.3% |
| l | 13 | 4.8% |
| y | 13 | 4.8% |
| t | 7 | 2.6% |
| Other values (12) | 40 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 81 | |
| 2 | 81 | |
| 0 | 69 | |
| 4 | 63 | |
| 6 | 60 | |
| 8 | 58 | |
| 7 | 55 | |
| 3 | 54 | |
| 9 | 52 | |
| 5 | 51 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 300 | |
| ' | 53 | 13.0% |
| , | 21 | 5.2% |
| / | 16 | 3.9% |
| ? | 11 | 2.7% |
| & | 3 | 0.7% |
| ; | 3 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 39569 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 75 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 242553 | |
| Common | 40682 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 28804 | |
| E | 24719 | 10.2% |
| R | 20756 | 8.6% |
| N | 17780 | 7.3% |
| I | 16515 | 6.8% |
| L | 15745 | 6.5% |
| O | 14796 | 6.1% |
| S | 12930 | 5.3% |
| T | 9780 | 4.0% |
| M | 9548 | 3.9% |
| Other values (38) | 71180 |
Common
| Value | Count | Frequency (%) |
| 39569 | ||
| . | 300 | 0.7% |
| 1 | 81 | 0.2% |
| 2 | 81 | 0.2% |
| - | 75 | 0.2% |
| 0 | 69 | 0.2% |
| 4 | 63 | 0.2% |
| 6 | 60 | 0.1% |
| 8 | 58 | 0.1% |
| 7 | 55 | 0.1% |
| Other values (12) | 271 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 283235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 39569 | ||
| A | 28804 | 10.2% |
| E | 24719 | 8.7% |
| R | 20756 | 7.3% |
| N | 17780 | 6.3% |
| I | 16515 | 5.8% |
| L | 15745 | 5.6% |
| O | 14796 | 5.2% |
| S | 12930 | 4.6% |
| T | 9780 | 3.5% |
| Other values (60) | 81841 |
| Distinct | 26232 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 11812 |
| Missing (%) | 18.6% |
| Memory size | 496.8 KiB |
| N.Y.C.H.A. | 2262 |
|---|---|
| NEW YORK CITY HOUSING AUTHORITY | 2119 |
| NYCHA | 1600 |
| PR | 1160 |
| NYC HOUSING AUTHORITY | 712 |
| Other values (26227) |
Length
| Max length | 100 |
|---|---|
| Median length | 80 |
| Mean length | 21.17523085 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1096157 |
|---|---|
| Distinct characters | 85 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 18083 ? |
|---|---|
| Unique (%) | 34.9% |
Sample
| 1st row | BROOKFIELD PROPERTIES |
|---|---|
| 2nd row | 62 COOPER SQUARE CONDOMINIUM |
| 3rd row | ONE STATE STREET, LLC |
| 4th row | VERIZON NEW YORK, INC |
| 5th row | CUSHMAN & WAKEFIELD INC |
Common Values
| Value | Count | Frequency (%) |
| N.Y.C.H.A. | 2262 | 3.6% |
| NEW YORK CITY HOUSING AUTHORITY | 2119 | 3.3% |
| NYCHA | 1600 | 2.5% |
| PR | 1160 | 1.8% |
| NYC HOUSING AUTHORITY | 712 | 1.1% |
| COLUMBIA UNIVERSITY | 296 | 0.5% |
| NEW YORK UNIVERSITY | 188 | 0.3% |
| N.Y.C.H.A | 125 | 0.2% |
| PARKCHESTER NORTH CONDOMINIUM | 103 | 0.2% |
| PARKCHESTER SOUTH CONDOMINIUM, INC. | 97 | 0.2% |
| Other values (26222) | 43104 | |
| (Missing) | 11812 | 18.6% |
Length
| Value | Count | Frequency (%) |
| llc | 9196 | 5.2% |
| corp | 8192 | 4.6% |
| inc | 4276 | 2.4% |
| owners | 3820 | 2.2% |
| housing | 3819 | 2.2% |
| realty | 3798 | 2.1% |
| street | 3452 | 1.9% |
| new | 2937 | 1.7% |
| authority | 2900 | 1.6% |
| york | 2806 | 1.6% |
| Other values (11022) | 132324 |
Most occurring characters
| Value | Count | Frequency (%) |
| 125931 | 11.5% | |
| E | 73560 | 6.7% |
| O | 60938 | 5.6% |
| R | 59903 | 5.5% |
| T | 59813 | 5.5% |
| A | 59217 | 5.4% |
| N | 58097 | 5.3% |
| C | 54491 | 5.0% |
| S | 49473 | 4.5% |
| I | 44528 | 4.1% |
| Other values (75) | 450206 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 761546 | |
| Space Separator | 125931 | 11.5% |
| Lowercase Letter | 118681 | 10.8% |
| Decimal Number | 57518 | 5.2% |
| Other Punctuation | 30122 | 2.7% |
| Dash Punctuation | 2197 | 0.2% |
| Open Punctuation | 60 | < 0.1% |
| Close Punctuation | 55 | < 0.1% |
| Math Symbol | 33 | < 0.1% |
| Other Number | 11 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15576 | |
| o | 11185 | |
| n | 11146 | |
| t | 10992 | |
| r | 10672 | |
| a | 9436 | 8.0% |
| s | 7868 | 6.6% |
| i | 7708 | 6.5% |
| l | 4209 | 3.5% |
| m | 4070 | 3.4% |
| Other values (17) | 25819 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 73560 | 9.7% |
| O | 60938 | 8.0% |
| R | 59903 | 7.9% |
| T | 59813 | 7.9% |
| A | 59217 | 7.8% |
| N | 58097 | 7.6% |
| C | 54491 | 7.2% |
| S | 49473 | 6.5% |
| I | 44528 | 5.8% |
| L | 43456 | 5.7% |
| Other values (16) | 198070 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 21375 | |
| , | 5619 | 18.7% |
| / | 1410 | 4.7% |
| & | 1021 | 3.4% |
| ' | 404 | 1.3% |
| ; | 135 | 0.4% |
| # | 114 | 0.4% |
| @ | 19 | 0.1% |
| ¿ | 11 | < 0.1% |
| % | 7 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10312 | |
| 2 | 7276 | |
| 0 | 6892 | |
| 5 | 6802 | |
| 3 | 5923 | |
| 4 | 4928 | |
| 7 | 4363 | |
| 6 | 4088 | 7.1% |
| 8 | 3671 | 6.4% |
| 9 | 3263 | 5.7% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 32 | |
| < | 1 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 125931 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2197 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 60 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 55 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 11 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 880227 | |
| Common | 215930 | 19.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 73560 | 8.4% |
| O | 60938 | 6.9% |
| R | 59903 | 6.8% |
| T | 59813 | 6.8% |
| A | 59217 | 6.7% |
| N | 58097 | 6.6% |
| C | 54491 | 6.2% |
| S | 49473 | 5.6% |
| I | 44528 | 5.1% |
| L | 43456 | 4.9% |
| Other values (43) | 316751 |
Common
| Value | Count | Frequency (%) |
| 125931 | ||
| . | 21375 | 9.9% |
| 1 | 10312 | 4.8% |
| 2 | 7276 | 3.4% |
| 0 | 6892 | 3.2% |
| 5 | 6802 | 3.2% |
| 3 | 5923 | 2.7% |
| , | 5619 | 2.6% |
| 4 | 4928 | 2.3% |
| 7 | 4363 | 2.0% |
| Other values (22) | 16509 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1096124 | |
| None | 33 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 125931 | 11.5% | |
| E | 73560 | 6.7% |
| O | 60938 | 5.6% |
| R | 59903 | 5.5% |
| T | 59813 | 5.5% |
| A | 59217 | 5.4% |
| N | 58097 | 5.3% |
| C | 54491 | 5.0% |
| S | 49473 | 4.5% |
| I | 44528 | 4.1% |
| Other values (72) | 450173 |
None
| Value | Count | Frequency (%) |
| ï | 11 | |
| ¿ | 11 | |
| ½ | 11 |
| Distinct | 4464 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 12782 |
| Missing (%) | 20.1% |
| Memory size | 496.8 KiB |
| 02/21/2007 12:00:00 AM | 1215 |
|---|---|
| 02/20/2007 12:00:00 AM | 733 |
| 02/21/2012 12:00:00 AM | 642 |
| 02/21/2022 12:00:00 AM | 501 |
| 02/21/2017 12:00:00 AM | 464 |
| Other values (4459) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 1117512 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 440 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 02/21/2012 12:00:00 AM |
|---|---|
| 2nd row | 11/07/2012 12:00:00 AM |
| 3rd row | 03/26/2012 12:00:00 AM |
| 4th row | 08/20/2012 12:00:00 AM |
| 5th row | 11/10/2011 12:00:00 AM |
Common Values
| Value | Count | Frequency (%) |
| 02/21/2007 12:00:00 AM | 1215 | 1.9% |
| 02/20/2007 12:00:00 AM | 733 | 1.2% |
| 02/21/2012 12:00:00 AM | 642 | 1.0% |
| 02/21/2022 12:00:00 AM | 501 | 0.8% |
| 02/21/2017 12:00:00 AM | 464 | 0.7% |
| 02/18/2022 12:00:00 AM | 445 | 0.7% |
| 02/16/2007 12:00:00 AM | 408 | 0.6% |
| 08/21/2012 12:00:00 AM | 351 | 0.6% |
| 02/21/2013 12:00:00 AM | 350 | 0.6% |
| 08/20/2012 12:00:00 AM | 347 | 0.5% |
| Other values (4454) | 45340 | |
| (Missing) | 12782 | 20.1% |
Length
| Value | Count | Frequency (%) |
| am | 50789 | |
| 12:00:00 | 50626 | |
| 02/21/2007 | 1215 | 0.8% |
| 02/20/2007 | 733 | 0.5% |
| 02/21/2012 | 642 | 0.4% |
| 02/21/2022 | 501 | 0.3% |
| 02/21/2017 | 464 | 0.3% |
| 02/18/2022 | 445 | 0.3% |
| 02/16/2007 | 408 | 0.3% |
| 08/21/2012 | 351 | 0.2% |
| Other values (4454) | 46214 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 332244 | |
| 2 | 161645 | |
| 1 | 128676 | 11.5% |
| / | 101592 | 9.1% |
| 101592 | 9.1% | |
| : | 101592 | 9.1% |
| M | 50796 | 4.5% |
| A | 50789 | 4.5% |
| 7 | 18081 | 1.6% |
| 8 | 14047 | 1.3% |
| Other values (6) | 56458 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 711144 | |
| Other Punctuation | 203184 | 18.2% |
| Space Separator | 101592 | 9.1% |
| Uppercase Letter | 101592 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 332244 | |
| 2 | 161645 | |
| 1 | 128676 | 18.1% |
| 7 | 18081 | 2.5% |
| 8 | 14047 | 2.0% |
| 3 | 13383 | 1.9% |
| 6 | 12495 | 1.8% |
| 9 | 12238 | 1.7% |
| 5 | 10054 | 1.4% |
| 4 | 8281 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 50796 | |
| A | 50789 | |
| P | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 101592 | |
| : | 101592 |
Space Separator
| Value | Count | Frequency (%) |
| 101592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1015920 | |
| Latin | 101592 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 332244 | |
| 2 | 161645 | |
| 1 | 128676 | 12.7% |
| / | 101592 | 10.0% |
| 101592 | 10.0% | |
| : | 101592 | 10.0% |
| 7 | 18081 | 1.8% |
| 8 | 14047 | 1.4% |
| 3 | 13383 | 1.3% |
| 6 | 12495 | 1.2% |
| Other values (3) | 30573 | 3.0% |
Latin
| Value | Count | Frequency (%) |
| M | 50796 | |
| A | 50789 | |
| P | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1117512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 332244 | |
| 2 | 161645 | |
| 1 | 128676 | 11.5% |
| / | 101592 | 9.1% |
| 101592 | 9.1% | |
| : | 101592 | 9.1% |
| M | 50796 | 4.5% |
| A | 50789 | 4.5% |
| 7 | 18081 | 1.6% |
| 8 | 14047 | 1.3% |
| Other values (6) | 56458 | 5.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 496.8 KiB |
| SAFE | |
|---|---|
| SWARMP | |
| No Report Filed | |
| UNSAFE |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 7.1382711 |
| Min length | 4 |
Characters and Unicode
| Total characters | 453837 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Report Filed |
|---|---|
| 2nd row | No Report Filed |
| 3rd row | No Report Filed |
| 4th row | No Report Filed |
| 5th row | No Report Filed |
Common Values
| Value | Count | Frequency (%) |
| SAFE | 21321 | |
| SWARMP | 19163 | |
| No Report Filed | 12779 | |
| UNSAFE | 10315 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| safe | 21321 | |
| swarmp | 19163 | |
| no | 12779 | |
| report | 12779 | |
| filed | 12779 | |
| unsafe | 10315 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 50799 | |
| A | 50799 | |
| F | 44415 | 9.8% |
| R | 31942 | 7.0% |
| E | 31636 | 7.0% |
| e | 25558 | 5.6% |
| 25558 | 5.6% | |
| o | 25558 | 5.6% |
| N | 23094 | 5.1% |
| P | 19163 | 4.2% |
| Other values (9) | 125315 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 300489 | |
| Lowercase Letter | 127790 | |
| Space Separator | 25558 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 50799 | |
| A | 50799 | |
| F | 44415 | |
| R | 31942 | |
| E | 31636 | |
| N | 23094 | |
| P | 19163 | 6.4% |
| M | 19163 | 6.4% |
| W | 19163 | 6.4% |
| U | 10315 | 3.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25558 | |
| o | 25558 | |
| p | 12779 | |
| r | 12779 | |
| t | 12779 | |
| i | 12779 | |
| l | 12779 | |
| d | 12779 |
Space Separator
| Value | Count | Frequency (%) |
| 25558 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 428279 | |
| Common | 25558 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 50799 | |
| A | 50799 | |
| F | 44415 | |
| R | 31942 | 7.5% |
| E | 31636 | 7.4% |
| e | 25558 | 6.0% |
| o | 25558 | 6.0% |
| N | 23094 | 5.4% |
| P | 19163 | 4.5% |
| M | 19163 | 4.5% |
| Other values (8) | 106152 |
Common
| Value | Count | Frequency (%) |
| 25558 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 453837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 50799 | |
| A | 50799 | |
| F | 44415 | 9.8% |
| R | 31942 | 7.0% |
| E | 31636 | 7.0% |
| e | 25558 | 5.6% |
| 25558 | 5.6% | |
| o | 25558 | 5.6% |
| N | 23094 | 5.1% |
| P | 19163 | 4.2% |
| Other values (9) | 125315 |
| Distinct | 5274 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 20508 |
| Missing (%) | 32.3% |
| Memory size | 496.8 KiB |
| 02/21/2007 12:00:00 AM | 1288 |
|---|---|
| 02/21/2002 12:00:00 AM | 982 |
| 02/20/2007 12:00:00 AM | 768 |
| 03/01/2000 12:00:00 AM | 664 |
| 02/29/2000 12:00:00 AM | 551 |
| Other values (5269) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 947540 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1250 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | 12/08/2006 12:00:00 AM |
|---|---|
| 2nd row | 02/20/2007 12:00:00 AM |
| 3rd row | 10/22/2007 12:00:00 AM |
| 4th row | 01/17/2007 12:00:00 AM |
| 5th row | 11/09/2006 12:00:00 AM |
Common Values
| Value | Count | Frequency (%) |
| 02/21/2007 12:00:00 AM | 1288 | 2.0% |
| 02/21/2002 12:00:00 AM | 982 | 1.5% |
| 02/20/2007 12:00:00 AM | 768 | 1.2% |
| 03/01/2000 12:00:00 AM | 664 | 1.0% |
| 02/29/2000 12:00:00 AM | 551 | 0.9% |
| 02/21/2012 12:00:00 AM | 451 | 0.7% |
| 02/16/2007 12:00:00 AM | 420 | 0.7% |
| 02/21/2017 12:00:00 AM | 389 | 0.6% |
| 02/28/2000 12:00:00 AM | 332 | 0.5% |
| 02/15/2007 12:00:00 AM | 317 | 0.5% |
| Other values (5264) | 36908 | |
| (Missing) | 20508 |
Length
| Value | Count | Frequency (%) |
| am | 43064 | |
| 12:00:00 | 41879 | |
| 02/21/2007 | 1288 | 1.0% |
| 01:00:00 | 1185 | 0.9% |
| 02/21/2002 | 982 | 0.8% |
| 02/20/2007 | 768 | 0.6% |
| 03/01/2000 | 664 | 0.5% |
| 02/29/2000 | 551 | 0.4% |
| 02/21/2012 | 451 | 0.3% |
| 02/16/2007 | 420 | 0.3% |
| Other values (5267) | 37958 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 297624 | |
| 2 | 131018 | |
| 1 | 99853 | 10.5% |
| / | 86140 | 9.1% |
| 86140 | 9.1% | |
| : | 86140 | 9.1% |
| M | 43070 | 4.5% |
| A | 43064 | 4.5% |
| 7 | 15196 | 1.6% |
| 3 | 14153 | 1.5% |
| Other values (6) | 45142 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 602980 | |
| Other Punctuation | 172280 | 18.2% |
| Space Separator | 86140 | 9.1% |
| Uppercase Letter | 86140 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 297624 | |
| 2 | 131018 | |
| 1 | 99853 | 16.6% |
| 7 | 15196 | 2.5% |
| 3 | 14153 | 2.3% |
| 9 | 11073 | 1.8% |
| 8 | 9686 | 1.6% |
| 6 | 9192 | 1.5% |
| 4 | 7801 | 1.3% |
| 5 | 7384 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 43070 | |
| A | 43064 | |
| P | 6 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 86140 | |
| : | 86140 |
Space Separator
| Value | Count | Frequency (%) |
| 86140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 861400 | |
| Latin | 86140 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 297624 | |
| 2 | 131018 | |
| 1 | 99853 | 11.6% |
| / | 86140 | 10.0% |
| 86140 | 10.0% | |
| : | 86140 | 10.0% |
| 7 | 15196 | 1.8% |
| 3 | 14153 | 1.6% |
| 9 | 11073 | 1.3% |
| 8 | 9686 | 1.1% |
| Other values (3) | 24377 | 2.8% |
Latin
| Value | Count | Frequency (%) |
| M | 43070 | |
| A | 43064 | |
| P | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 947540 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 297624 | |
| 2 | 131018 | |
| 1 | 99853 | 10.5% |
| / | 86140 | 9.1% |
| 86140 | 9.1% | |
| : | 86140 | 9.1% |
| M | 43070 | 4.5% |
| A | 43064 | 4.5% |
| 7 | 15196 | 1.6% |
| 3 | 14153 | 1.5% |
| Other values (6) | 45142 | 4.8% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17851 |
| Missing (%) | 28.1% |
| Memory size | 496.8 KiB |
| SAFE | |
|---|---|
| SWARMP | |
| UNSAFE | |
| No Report Filed |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 5.724429768 |
| Min length | 4 |
Characters and Unicode
| Total characters | 261761 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SWARMP |
|---|---|
| 2nd row | SWARMP |
| 3rd row | SWARMP |
| 4th row | SAFE |
| 5th row | SAFE |
Common Values
| Value | Count | Frequency (%) |
| SAFE | 19148 | |
| SWARMP | 18778 | |
| UNSAFE | 4946 | 7.8% |
| No Report Filed | 2855 | 4.5% |
| (Missing) | 17851 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| safe | 19148 | |
| swarmp | 18778 | |
| unsafe | 4946 | 9.6% |
| no | 2855 | 5.6% |
| report | 2855 | 5.6% |
| filed | 2855 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 42872 | |
| A | 42872 | |
| F | 26949 | |
| E | 24094 | |
| R | 21633 | |
| W | 18778 | |
| M | 18778 | |
| P | 18778 | |
| N | 7801 | 3.0% |
| o | 5710 | 2.2% |
| Other values (9) | 33496 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 227501 | |
| Lowercase Letter | 28550 | 10.9% |
| Space Separator | 5710 | 2.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 42872 | |
| A | 42872 | |
| F | 26949 | |
| E | 24094 | |
| R | 21633 | |
| W | 18778 | |
| M | 18778 | |
| P | 18778 | |
| N | 7801 | 3.4% |
| U | 4946 | 2.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5710 | |
| e | 5710 | |
| p | 2855 | |
| r | 2855 | |
| t | 2855 | |
| i | 2855 | |
| l | 2855 | |
| d | 2855 |
Space Separator
| Value | Count | Frequency (%) |
| 5710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 256051 | |
| Common | 5710 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 42872 | |
| A | 42872 | |
| F | 26949 | |
| E | 24094 | |
| R | 21633 | |
| W | 18778 | |
| M | 18778 | |
| P | 18778 | |
| N | 7801 | 3.0% |
| o | 5710 | 2.2% |
| Other values (8) | 27786 |
Common
| Value | Count | Frequency (%) |
| 5710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 261761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 42872 | |
| A | 42872 | |
| F | 26949 | |
| E | 24094 | |
| R | 21633 | |
| W | 18778 | |
| M | 18778 | |
| P | 18778 | |
| N | 7801 | 3.0% |
| o | 5710 | 2.2% |
| Other values (9) | 33496 |
| Distinct | 5145 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 16664 |
| Missing (%) | 26.2% |
| Memory size | 496.8 KiB |
| 02/11/2022 12:00:00 AM | 143 |
|---|---|
| 02/16/2022 12:00:00 AM | 131 |
| 02/15/2022 12:00:00 AM | 131 |
| 02/18/2022 12:00:00 AM | 125 |
| 02/09/2022 12:00:00 AM | 125 |
| Other values (5140) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 1032108 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 773 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | 02/10/2012 12:00:00 AM |
|---|---|
| 2nd row | 10/04/2012 12:00:00 AM |
| 3rd row | 01/23/2012 12:00:00 AM |
| 4th row | 07/25/2012 12:00:00 AM |
| 5th row | 10/19/2011 12:00:00 AM |
Common Values
| Value | Count | Frequency (%) |
| 02/11/2022 12:00:00 AM | 143 | 0.2% |
| 02/16/2022 12:00:00 AM | 131 | 0.2% |
| 02/15/2022 12:00:00 AM | 131 | 0.2% |
| 02/18/2022 12:00:00 AM | 125 | 0.2% |
| 02/09/2022 12:00:00 AM | 125 | 0.2% |
| 11/01/2006 01:00:00 AM | 125 | 0.2% |
| 02/15/2012 12:00:00 AM | 120 | 0.2% |
| 02/08/2022 12:00:00 AM | 118 | 0.2% |
| 12/01/2006 12:00:00 AM | 117 | 0.2% |
| 02/10/2022 12:00:00 AM | 113 | 0.2% |
| Other values (5135) | 45666 | |
| (Missing) | 16664 | 26.2% |
Length
| Value | Count | Frequency (%) |
| am | 46905 | |
| 12:00:00 | 46302 | |
| 01:00:00 | 603 | 0.4% |
| 02/11/2022 | 143 | 0.1% |
| 02/16/2022 | 131 | 0.1% |
| 02/15/2022 | 131 | 0.1% |
| 02/18/2022 | 125 | 0.1% |
| 02/09/2022 | 125 | 0.1% |
| 11/01/2006 | 125 | 0.1% |
| 02/15/2012 | 120 | 0.1% |
| Other values (5139) | 46032 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 307188 | |
| 2 | 139481 | |
| 1 | 125134 | |
| / | 93828 | 9.1% |
| 93828 | 9.1% | |
| : | 93828 | 9.1% |
| M | 46914 | 4.5% |
| A | 46905 | 4.5% |
| 7 | 15264 | 1.5% |
| 6 | 14609 | 1.4% |
| Other values (6) | 55129 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 656796 | |
| Other Punctuation | 187656 | 18.2% |
| Space Separator | 93828 | 9.1% |
| Uppercase Letter | 93828 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 307188 | |
| 2 | 139481 | |
| 1 | 125134 | |
| 7 | 15264 | 2.3% |
| 6 | 14609 | 2.2% |
| 8 | 12604 | 1.9% |
| 3 | 12491 | 1.9% |
| 9 | 11513 | 1.8% |
| 5 | 10865 | 1.7% |
| 4 | 7647 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 46914 | |
| A | 46905 | |
| P | 9 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 93828 | |
| : | 93828 |
Space Separator
| Value | Count | Frequency (%) |
| 93828 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 938280 | |
| Latin | 93828 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 307188 | |
| 2 | 139481 | |
| 1 | 125134 | |
| / | 93828 | 10.0% |
| 93828 | 10.0% | |
| : | 93828 | 10.0% |
| 7 | 15264 | 1.6% |
| 6 | 14609 | 1.6% |
| 8 | 12604 | 1.3% |
| 3 | 12491 | 1.3% |
| Other values (3) | 30025 | 3.2% |
Latin
| Value | Count | Frequency (%) |
| M | 46914 | |
| A | 46905 | |
| P | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1032108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 307188 | |
| 2 | 139481 | |
| 1 | 125134 | |
| / | 93828 | 9.1% |
| 93828 | 9.1% | |
| : | 93828 | 9.1% |
| M | 46914 | 4.5% |
| A | 46905 | 4.5% |
| 7 | 15264 | 1.5% |
| 6 | 14609 | 1.4% |
| Other values (6) | 55129 | 5.3% |
| Distinct | 5198 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 17763 |
| Missing (%) | 27.9% |
| Memory size | 496.8 KiB |
| 02/15/2007 12:00:00 AM | 395 |
|---|---|
| 02/20/2007 12:00:00 AM | 376 |
| 02/18/2022 12:00:00 AM | 355 |
| 02/16/2007 12:00:00 AM | 351 |
| 02/14/2007 12:00:00 AM | 293 |
| Other values (5193) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 1007930 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 773 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 02/12/2012 12:00:00 AM |
|---|---|
| 2nd row | 10/25/2012 12:00:00 AM |
| 3rd row | 03/15/2012 12:00:00 AM |
| 4th row | 08/17/2012 12:00:00 AM |
| 5th row | 10/28/2011 12:00:00 AM |
Common Values
| Value | Count | Frequency (%) |
| 02/15/2007 12:00:00 AM | 395 | 0.6% |
| 02/20/2007 12:00:00 AM | 376 | 0.6% |
| 02/18/2022 12:00:00 AM | 355 | 0.6% |
| 02/16/2007 12:00:00 AM | 351 | 0.6% |
| 02/14/2007 12:00:00 AM | 293 | 0.5% |
| 02/21/2022 12:00:00 AM | 286 | 0.4% |
| 02/12/2007 12:00:00 AM | 255 | 0.4% |
| 02/13/2007 12:00:00 AM | 225 | 0.4% |
| 02/16/2012 12:00:00 AM | 206 | 0.3% |
| 08/16/2012 12:00:00 AM | 198 | 0.3% |
| Other values (5188) | 42875 | |
| (Missing) | 17763 |
Length
| Value | Count | Frequency (%) |
| am | 35735 | |
| 12:00:00 | 35506 | |
| pm | 10080 | 7.3% |
| 07:00:00 | 5200 | 3.8% |
| 08:00:00 | 4874 | 3.5% |
| 02/15/2007 | 395 | 0.3% |
| 02/20/2007 | 376 | 0.3% |
| 02/18/2022 | 355 | 0.3% |
| 02/16/2007 | 351 | 0.3% |
| 02/14/2007 | 293 | 0.2% |
| Other values (5066) | 44280 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 310968 | |
| 2 | 129244 | |
| 1 | 108061 | 10.7% |
| / | 91630 | 9.1% |
| 91630 | 9.1% | |
| : | 91630 | 9.1% |
| M | 45815 | 4.5% |
| A | 35735 | 3.5% |
| 7 | 22286 | 2.2% |
| 8 | 17403 | 1.7% |
| Other values (6) | 63528 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 641410 | |
| Other Punctuation | 183260 | 18.2% |
| Space Separator | 91630 | 9.1% |
| Uppercase Letter | 91630 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 310968 | |
| 2 | 129244 | |
| 1 | 108061 | 16.8% |
| 7 | 22286 | 3.5% |
| 8 | 17403 | 2.7% |
| 6 | 12465 | 1.9% |
| 3 | 12413 | 1.9% |
| 9 | 11131 | 1.7% |
| 5 | 10081 | 1.6% |
| 4 | 7358 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 45815 | |
| A | 35735 | |
| P | 10080 | 11.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 91630 | |
| : | 91630 |
Space Separator
| Value | Count | Frequency (%) |
| 91630 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 916300 | |
| Latin | 91630 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 310968 | |
| 2 | 129244 | |
| 1 | 108061 | 11.8% |
| / | 91630 | 10.0% |
| 91630 | 10.0% | |
| : | 91630 | 10.0% |
| 7 | 22286 | 2.4% |
| 8 | 17403 | 1.9% |
| 6 | 12465 | 1.4% |
| 3 | 12413 | 1.4% |
| Other values (3) | 28570 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| M | 45815 | |
| A | 35735 | |
| P | 10080 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1007930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 310968 | |
| 2 | 129244 | |
| 1 | 108061 | 10.7% |
| / | 91630 | 9.1% |
| 91630 | 9.1% | |
| : | 91630 | 9.1% |
| M | 45815 | 4.5% |
| A | 35735 | 3.5% |
| 7 | 22286 | 2.2% |
| 8 | 17403 | 1.7% |
| Other values (6) | 63528 | 6.3% |
LATE_FILING_AMT
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 673 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 1385 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8236.741273 |
| Minimum | 0 |
|---|---|
| Maximum | 157500 |
| Zeros | 19300 |
| Zeros (%) | 30.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4000 |
| Q3 | 9250 |
| 95-th percentile | 36000 |
| Maximum | 157500 |
| Range | 157500 |
| Interquartile range (IQR) | 9250 |
Descriptive statistics
| Standard deviation | 14338.29869 |
|---|---|
| Coefficient of variation (CV) | 1.740773228 |
| Kurtosis | 25.29311766 |
| Mean | 8236.741273 |
| Median Absolute Deviation (MAD) | 4000 |
| Skewness | 4.058457769 |
| Sum | 512267650 |
| Variance | 205586809.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 4000 | 6828 | 10.7% |
| 3000 | 905 | 1.4% |
| 250 | 844 | 1.3% |
| 7250 | 731 | 1.1% |
| 6000 | 664 | 1.0% |
| 1000 | 624 | 1.0% |
| 1500 | 619 | 1.0% |
| 31000 | 569 | 0.9% |
| 4250 | 545 | 0.9% |
| Other values (663) | 30564 | |
| (Missing) | 1385 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 150 | 368 | 0.6% |
| 250 | 844 | 1.3% |
| 300 | 124 | 0.2% |
| 400 | 91 | 0.1% |
| 450 | 149 | 0.2% |
| 500 | 491 | 0.8% |
| 600 | 86 | 0.1% |
| 650 | 37 | 0.1% |
| 700 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 157500 | 32 | |
| 149000 | 24 | |
| 144500 | 25 | |
| 141500 | 43 | |
| 134000 | 5 | < 0.1% |
| 117500 | 4 | < 0.1% |
| 116500 | 2 | < 0.1% |
| 102500 | 5 | < 0.1% |
| 101000 | 4 | < 0.1% |
| 99500 | 9 | < 0.1% |
FAILURE_TO_FILE_AMT
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1379 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2119.037284 |
| Minimum | 0 |
|---|---|
| Maximum | 39000 |
| Zeros | 41113 |
| Zeros (%) | 64.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1000 |
| 95-th percentile | 15000 |
| Maximum | 39000 |
| Range | 39000 |
| Interquartile range (IQR) | 1000 |
Descriptive statistics
| Standard deviation | 5222.800008 |
|---|---|
| Coefficient of variation (CV) | 2.46470416 |
| Kurtosis | 16.57271014 |
| Mean | 2119.037284 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.701956916 |
| Sum | 131802000 |
| Variance | 27277639.92 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41113 | |
| 1000 | 6042 | 9.5% |
| 2000 | 4233 | 6.7% |
| 5000 | 1629 | 2.6% |
| 3000 | 1480 | 2.3% |
| 16000 | 1048 | 1.6% |
| 15000 | 1000 | 1.6% |
| 17000 | 961 | 1.5% |
| 6000 | 880 | 1.4% |
| 4000 | 833 | 1.3% |
| Other values (21) | 2980 | 4.7% |
| (Missing) | 1379 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 41113 | |
| 1000 | 6042 | 9.5% |
| 2000 | 4233 | 6.7% |
| 3000 | 1480 | 2.3% |
| 4000 | 833 | 1.3% |
| 5000 | 1629 | 2.6% |
| 6000 | 880 | 1.4% |
| 7000 | 516 | 0.8% |
| 8000 | 180 | 0.3% |
| 9000 | 138 | 0.2% |
| Value | Count | Frequency (%) |
| 39000 | 196 | |
| 37000 | 3 | < 0.1% |
| 36000 | 173 | |
| 33000 | 107 | |
| 29000 | 80 | |
| 28000 | 21 | < 0.1% |
| 26000 | 15 | < 0.1% |
| 23000 | 23 | < 0.1% |
| 22000 | 2 | < 0.1% |
| 21000 | 5 | < 0.1% |
| Distinct | 477 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1200 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4051.667575 |
| Minimum | 0 |
|---|---|
| Maximum | 1048000 |
| Zeros | 51488 |
| Zeros (%) | 81.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 496.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 24000 |
| Maximum | 1048000 |
| Range | 1048000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 18732.42146 |
|---|---|
| Coefficient of variation (CV) | 4.623385584 |
| Kurtosis | 954.9867386 |
| Mean | 4051.667575 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.51379133 |
| Sum | 252734920 |
| Variance | 350903613.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51488 | |
| 1000 | 1361 | 2.1% |
| 2000 | 754 | 1.2% |
| 3000 | 621 | 1.0% |
| 5000 | 573 | 0.9% |
| 4000 | 514 | 0.8% |
| 6000 | 462 | 0.7% |
| 8000 | 357 | 0.6% |
| 9000 | 342 | 0.5% |
| 7000 | 332 | 0.5% |
| Other values (467) | 5574 | 8.8% |
| (Missing) | 1200 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 51488 | |
| 1000 | 1361 | 2.1% |
| 2000 | 754 | 1.2% |
| 3000 | 621 | 1.0% |
| 4000 | 514 | 0.8% |
| 5000 | 573 | 0.9% |
| 6000 | 462 | 0.7% |
| 7000 | 332 | 0.5% |
| 8000 | 357 | 0.6% |
| 9000 | 342 | 0.5% |
| Value | Count | Frequency (%) |
| 1048000 | 6 | |
| 294900 | 3 | |
| 262000 | 4 | |
| 231300 | 6 | |
| 211600 | 3 | |
| 207600 | 3 | |
| 205800 | 3 | |
| 204600 | 3 | |
| 202940 | 4 | |
| 199200 | 4 |
| Distinct | 9038 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 45747 |
| Missing (%) | 72.0% |
| Memory size | 496.8 KiB |
| N.Y.C.H.A | 447 |
|---|---|
| RESUBMISSION | 315 |
| ALTERNATIVE PROGRAM | 243 |
| DATA ENTERED BY RT | 197 |
| NEW YORK CITY HOUSING AUTHORITY | 176 |
| Other values (9033) |
Length
| Max length | 102 |
|---|---|
| Median length | 88 |
| Mean length | 60.11339801 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1071882 |
|---|---|
| Distinct characters | 88 |
| Distinct categories | 13 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5568 ? |
|---|---|
| Unique (%) | 31.2% |
Sample
| 1st row | PHILIP DEANS - OWNER - PHN# 212-673-6262EMAIL: PDEANS@CABRINIELDERCARE.ORG |
|---|---|
| 2nd row | REPORT FILED 6/30/15 WAS REJECTED |
| 3rd row | REPORT FILED 6/30/15 WAS REJECTED |
| 4th row | INITIAL REPORT FILED 6/30/15 WAS REJECTED |
| 5th row | INITIAL REPORT FILED 02/08/2016 WAS REJECTEDGAIL WEINER - ASSISTANT SECRETARY - PHN# 212-753-3381EMA |
Common Values
| Value | Count | Frequency (%) |
| N.Y.C.H.A | 447 | 0.7% |
| RESUBMISSION | 315 | 0.5% |
| ALTERNATIVE PROGRAM | 243 | 0.4% |
| DATA ENTERED BY RT | 197 | 0.3% |
| NEW YORK CITY HOUSING AUTHORITY | 176 | 0.3% |
| CITY OWNED | 150 | 0.2% |
| CITY OWNED NO PENALTY | 122 | 0.2% |
| AMENDED FILING: OWNER: MARTHA BRAZOBAN | 120 | 0.2% |
| ADDED TO CYCLE 6 | 41 | 0.1% |
| SUBSEQUENT | 38 | 0.1% |
| Other values (9028) | 15982 | 25.1% |
| (Missing) | 45747 |
Length
| Value | Count | Frequency (%) |
| 5877 | 3.9% | |
| to | 4204 | 2.8% |
| on | 3646 | 2.4% |
| penalty | 2730 | 1.8% |
| filing | 2585 | 1.7% |
| report | 2576 | 1.7% |
| civil | 2382 | 1.6% |
| penalties | 2364 | 1.6% |
| stopped | 2326 | 1.5% |
| and | 2296 | 1.5% |
| Other values (14229) | 121481 |
Most occurring characters
| Value | Count | Frequency (%) |
| 135020 | 12.6% | |
| E | 71560 | 6.7% |
| I | 49344 | 4.6% |
| A | 47969 | 4.5% |
| T | 46728 | 4.4% |
| N | 41782 | 3.9% |
| R | 40209 | 3.8% |
| S | 34521 | 3.2% |
| O | 32768 | 3.1% |
| 1 | 31857 | 3.0% |
| Other values (78) | 540124 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 595342 | |
| Decimal Number | 152367 | 14.2% |
| Space Separator | 135020 | 12.6% |
| Lowercase Letter | 108321 | 10.1% |
| Other Punctuation | 55445 | 5.2% |
| Dash Punctuation | 12574 | 1.2% |
| Open Punctuation | 6365 | 0.6% |
| Close Punctuation | 6116 | 0.6% |
| Currency Symbol | 277 | < 0.1% |
| Math Symbol | 38 | < 0.1% |
| Other values (3) | 17 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 71560 | |
| I | 49344 | 8.3% |
| A | 47969 | 8.1% |
| T | 46728 | 7.8% |
| N | 41782 | 7.0% |
| R | 40209 | 6.8% |
| S | 34521 | 5.8% |
| O | 32768 | 5.5% |
| D | 29776 | 5.0% |
| L | 28542 | 4.8% |
| Other values (16) | 172143 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 13178 | |
| n | 12085 | |
| t | 11732 | |
| i | 10810 | |
| a | 9857 | |
| l | 8986 | |
| o | 8104 | |
| d | 7656 | |
| p | 5352 | 4.9% |
| s | 4843 | 4.5% |
| Other values (16) | 15718 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 26065 | |
| . | 10003 | 18.0% |
| : | 8425 | 15.2% |
| # | 3686 | 6.6% |
| , | 3512 | 6.3% |
| @ | 2386 | 4.3% |
| & | 1169 | 2.1% |
| ' | 160 | 0.3% |
| ; | 28 | 0.1% |
| ? | 5 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 31857 | |
| 2 | 26443 | |
| 0 | 26401 | |
| 6 | 16103 | |
| 5 | 11076 | 7.3% |
| 8 | 8854 | 5.8% |
| 3 | 8640 | 5.7% |
| 4 | 7984 | 5.2% |
| 9 | 7733 | 5.1% |
| 7 | 7276 | 4.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 31 | |
| > | 3 | 7.9% |
| < | 3 | 7.9% |
| = | 1 | 2.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6362 | |
| { | 3 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6113 | |
| } | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 135020 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12574 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 277 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 703663 | |
| Common | 368219 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 71560 | 10.2% |
| I | 49344 | 7.0% |
| A | 47969 | 6.8% |
| T | 46728 | 6.6% |
| N | 41782 | 5.9% |
| R | 40209 | 5.7% |
| S | 34521 | 4.9% |
| O | 32768 | 4.7% |
| D | 29776 | 4.2% |
| L | 28542 | 4.1% |
| Other values (42) | 280464 |
Common
| Value | Count | Frequency (%) |
| 135020 | ||
| 1 | 31857 | 8.7% |
| 2 | 26443 | 7.2% |
| 0 | 26401 | 7.2% |
| / | 26065 | 7.1% |
| 6 | 16103 | 4.4% |
| - | 12574 | 3.4% |
| 5 | 11076 | 3.0% |
| . | 10003 | 2.7% |
| 8 | 8854 | 2.4% |
| Other values (26) | 63823 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1071870 | |
| None | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 135020 | 12.6% | |
| E | 71560 | 6.7% |
| I | 49344 | 4.6% |
| A | 47969 | 4.5% |
| T | 46728 | 4.4% |
| N | 41782 | 3.9% |
| R | 40209 | 3.8% |
| S | 34521 | 3.2% |
| O | 32768 | 3.1% |
| 1 | 31857 | 3.0% |
| Other values (75) | 540112 |
None
| Value | Count | Frequency (%) |
| ¿ | 4 | |
| ½ | 4 | |
| ï | 4 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| TR6_NO | CONTROL_NO | FILING_TYPE | CYCLE | BIN | HOUSE_NO | STREET_NAME | BOROUGH | BLOCK | LOT | SEQUENCE_NO | SUBMITTED_ON | CURRENT_STATUS | QEWI_NAME | QEWI_BUS_NAME | QEWI_BUS_STREET_NAME | QEWI_CITY | QEWI_STATE | QEWI_ZIP | QEWI_NYS_LIC_NO | OWNER_NAME | OWNER_BUS_NAME | OWNER_BUS_STREET_NAME | OWNER_CITY | OWNER_ZIP | OWNER_STATE | FILING_DATE | FILING_STATUS | PRIOR_CYCLE_FILING_DATE | PRIOR_STATUS | FIELD_INSPECTION_COMPLETED_DATE | QEWI_SIGNED_DATE | LATE_FILING_AMT | FAILURE_TO_FILE_AMT | FAILURE_TO_COLLECT_AMT | COMMENTS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | TR6-913448-9A-N1 | 913448 | Auto-Generated | 9 | 4114712.0 | 143-45 | SANFORD AVENUE | QUEENS | 5049 | 38 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 11750.0 | 1000.0 | 0.0 | NaN |
| 1 | TR6-913451-9A-N1 | 913451 | Auto-Generated | 9 | 3393807.0 | 15 | OLIVER STREET | BROOKLYN | 6099 | 1 | 2.0 | NaN | UNSAFE | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 63400.0 | NaN |
| 2 | TR6-913456-9A-N1 | 913456 | Auto-Generated | 9 | 1077623.0 | 180 | ELDRIDGE STREET | MANHATTAN | 415 | 12 | 2.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 4250.0 | 0.0 | 0.0 | NaN |
| 3 | TR6-913458-9A-N1 | 913458 | Auto-Generated | 9 | 4001141.0 | 41-46 | 50 STREET | QUEENS | 134 | 1 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 13250.0 | 2000.0 | 1000.0 | NaN |
| 4 | TR6-913460-9A-N1 | 913460 | Auto-Generated | 9 | 1088779.0 | 220 | EAST 19 STREET | MANHATTAN | 899 | 46 | 1.0 | NaN | SAFE | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 500.0 | 0.0 | 0.0 | PHILIP DEANS - OWNER - PHN# 212-673-6262EMAIL: PDEANS@CABRINIELDERCARE.ORG |
| 5 | TR6-913471-9A-N1 | 913471 | Auto-Generated | 9 | 1030341.0 | 100 | AMSTERDAM AVENUE | MANHATTAN | 1156 | 30 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 4000.0 | 0.0 | 0.0 | NaN |
| 6 | TR6-913472-9A-N1 | 913472 | Auto-Generated | 9 | 1018503.0 | 160 | EAST 34 STREET | MANHATTAN | 889 | 50 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 4750.0 | 0.0 | 0.0 | NaN |
| 7 | TR6-913473-9A-N1 | 913473 | Auto-Generated | 9 | 1087286.0 | 300 | WEST 135 STREET | MANHATTAN | 1959 | 7501 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 11500.0 | 2000.0 | 20000.0 | NaN |
| 8 | TR6-913479-9A-N1 | 913479 | Auto-Generated | 9 | 4223678.0 | 190-05 | HILLSIDE AVENUE | QUEENS | 10499 | 75 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 9750.0 | 1000.0 | 0.0 | NaN |
| 9 | TR6-913480-9A-N1 | 913480 | Auto-Generated | 9 | 3337151.0 | 181 | 73 STREET | BROOKLYN | 5906 | 18 | 1.0 | NaN | No Report Filed | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | No Report Filed | NaN | NaN | NaN | NaN | 27500.0 | 1000.0 | 0.0 | NaN |
Last rows
| TR6_NO | CONTROL_NO | FILING_TYPE | CYCLE | BIN | HOUSE_NO | STREET_NAME | BOROUGH | BLOCK | LOT | SEQUENCE_NO | SUBMITTED_ON | CURRENT_STATUS | QEWI_NAME | QEWI_BUS_NAME | QEWI_BUS_STREET_NAME | QEWI_CITY | QEWI_STATE | QEWI_ZIP | QEWI_NYS_LIC_NO | OWNER_NAME | OWNER_BUS_NAME | OWNER_BUS_STREET_NAME | OWNER_CITY | OWNER_ZIP | OWNER_STATE | FILING_DATE | FILING_STATUS | PRIOR_CYCLE_FILING_DATE | PRIOR_STATUS | FIELD_INSPECTION_COMPLETED_DATE | QEWI_SIGNED_DATE | LATE_FILING_AMT | FAILURE_TO_FILE_AMT | FAILURE_TO_COLLECT_AMT | COMMENTS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 63568 | TR6-816512-8C-I1 | 816512 | Initial | 8 | 1088616.0 | 16 | WEST 21 STREET | MANHATTAN | 822 | 7505 | NaN | 2017-12-18 00:00:00 | SAFE | BARIS ACAR | PACE ENGINEERING P.C. | 183 MADISON AVENUE | NEW YORK | NY | 10016 | PE - 088950 | ALEX KIRK | PR | NaN | NaN | NaN | NaN | 12/18/2017 12:00:00 AM | SAFE | NaN | NaN | 10/25/2017 12:00:00 AM | 09/23/2017 08:00:00 PM | 0.0 | 0.0 | 0.0 | NaN |
| 63569 | TR6-816565-8B-I1 | 816565 | Initial | 8 | 3327373.0 | 725 | CHURCH AVENUE | BROOKLYN | 5330 | 24 | NaN | 2018-02-20 00:00:00 | SAFE | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | LAWRENCE BERNSTEIN | JONAS EQUITIES, INC. | NaN | NaN | NaN | NaN | 10/05/2018 12:00:00 AM | SAFE | NaN | NaN | 02/14/2018 12:00:00 AM | 06/30/2018 08:00:00 PM | 2000.0 | 0.0 | 0.0 | NaN |
| 63570 | TR6-916956-9A-I1 | 916956 | Initial | 9 | 3396957.0 | 185 | OCEAN AVENUE | BROOKLYN | 5026 | 7501 | NaN | 2020-03-12 00:00:00 | SAFE | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | JOSH SHINE | PR | NaN | NaN | NaN | NaN | 03/12/2020 12:00:00 AM | SAFE | NaN | NaN | 04/02/2020 12:00:00 AM | 03/30/2020 08:00:00 PM | 0.0 | 0.0 | 0.0 | NaN |
| 63571 | TR6-806107-8B-I1 | 806107 | Initial | 8 | 1052803.0 | 321 | EAST 108 STREET | MANHATTAN | 1680 | 13 | 1.0 | 2016-11-22 00:00:00 | SAFE | ANDREW KATZ | NaN | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | ROBERT GORDON | AJ Clarke Real Estate Corp. | NaN | NaN | NaN | NaN | 11/22/2016 12:00:00 AM | SAFE | 07/12/2011 12:00:00 AM | SWARMP | 09/28/2016 12:00:00 AM | 11/11/2016 12:00:00 AM | 10750.0 | 0.0 | 0.0 | INITIAL REPORT FILED 5/25/16 WAS REJECTED |
| 63572 | TR6-801841-8C-I1 | 801841 | Initial | 8 | 1015054.0 | 154 | WEST 27 STREET | MANHATTAN | 802 | 71 | 1.0 | 2017-11-29 00:00:00 | SWARMP | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | ISAAC SCHWARTZ | West End Estates LLC | NaN | NaN | NaN | NaN | 12/06/2018 12:00:00 AM | SWARMP | 05/11/2007 12:00:00 AM | SAFE | 12/03/2018 12:00:00 AM | 11/26/2018 07:00:00 PM | 17950.0 | 5000.0 | 0.0 | NaN |
| 63573 | TR6-801722-8B-I1 | 801722 | Initial | 8 | 1014471.0 | 576 | 8 AVENUE | MANHATTAN | 788 | 4 | 1.0 | 2016-12-22 00:00:00 | SWARMP | ANDREW KATZ | NaN | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | STEVEN GREEN | 580 8th Ave Realty | NaN | NaN | NaN | NaN | 12/22/2016 12:00:00 AM | SWARMP | 05/10/2012 12:00:00 AM | SAFE | 10/28/2016 12:00:00 AM | 12/14/2016 12:00:00 AM | 3750.0 | 0.0 | 0.0 | NaN |
| 63574 | TR6-814017-8B-I1 | 814017 | Initial | 8 | 2114714.0 | 1514 | SEDGWICK AVENUE | BRONX | 2880 | 9 | 1.0 | 2019-06-11 00:00:00 | SAFE | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | NOEMI MARTINEZ | SEDGWICK RIVERVIEW, L.P. | NaN | NaN | NaN | NaN | 06/11/2019 12:00:00 AM | SAFE | NaN | NaN | 06/05/2019 12:00:00 AM | 07/22/2019 08:00:00 PM | 4000.0 | 1000.0 | 0.0 | NaN |
| 63575 | TR6-814521-8A-I1 | 814521 | Initial | 8 | 3017631.0 | 266 | 22 STREET | BROOKLYN | 899 | 22 | 1.0 | 2019-01-09 00:00:00 | SWARMP | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | JACK LOCICERO | SOUTH SLOPE REALTY OF BROOKLYN, INC | NaN | NaN | NaN | NaN | 01/09/2019 12:00:00 AM | SWARMP | NaN | NaN | 12/14/2018 12:00:00 AM | 12/06/2018 07:00:00 PM | 9750.0 | 1000.0 | 0.0 | ADDED TO FISP UNIVERSE 8/11/2015 (CAW)FINAL C.O. 07/27/2004 |
| 63576 | TR6-810327-8C-I1 | 810327 | Initial | 8 | 3201014.0 | 2675 | OCEAN AVENUE | BROOKLYN | 7381 | 79 | 1.0 | 2018-10-31 00:00:00 | SWARMP | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | ELIE GABAY | Ocean Road Terrace Coop Apts Inc | NaN | NaN | NaN | NaN | 10/31/2018 12:00:00 AM | SWARMP | 05/08/2013 12:00:00 AM | SWARMP | 10/23/2018 12:00:00 AM | 12/28/2018 07:00:00 PM | 1800.0 | 0.0 | 0.0 | NaN |
| 63577 | TR6-810947-8C-I1 | 810947 | Initial | 8 | 4440232.0 | 99-60 | 63 ROAD | QUEENS | 2111 | 7501 | 3.0 | 2020-01-16 00:00:00 | No Report Filed | ANDREW KATZ | ANDREW KATZ ENGINEERS | 3452 BEDFORD AVE | BROOKLYN | NY | 11210 | PE - 051094 | ISAK RADONCIC | COMPREHENSIVE DESIGNS | NaN | NaN | NaN | NaN | NaN | No Report Filed | 01/12/2007 12:00:00 AM | SAFE | 08/24/2020 12:00:00 AM | 08/27/2020 08:00:00 PM | 80000.0 | 36000.0 | 0.0 | NaN |